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This invention relates to a protein regulating the sensitivity to an antimycotic aureobasidin and a gene 
coding for this protein, namely, a gene coding for a protein regulating aureobasidin sensitivity. The present 
invention further relates to a series of the uses of the protein and the gene. Furthermore, it relates to an 
antibody against this protein and the use of the same. 

5 Systemic mycoses including candidiasis have increased with an increase in immunocompromised 
patients in recent years due to, for example, the extended use of immunosuppressive drugs and acquired 
immunodeficiency syndrome (AIDS), and as opportunistic infection due to microbial substitution caused by 
the frequent use of widespectrum antibacterial antibiotics. Although drugs for treating mycoses such as 
amphotericin B, flucytosine and azole drugs (for example, fluconazole and miconazole) are now used to 

10 cope with this situation, none of them can achieve a satisfactory effect. Also, known diagnostic drugs are 
insufficient. For candidiasis, in particular, although there have been known several diagnostic drugs (for 
example, CAND-TEC for detection of Candida antigen and LABOFIT for detection of D-arabinitol),none of 
them gives any satisfactory results in specificity or sensitivity. 

The reasons for the delay in the development of remedies and diagnostic drugs for mycoses as 

75 described above are that fungi causing the mycoses are eukaryotic organisms similar to the host (i.e., man) 
and thus not largely different from man and that knowledges of fungi, in particular, pathogenic fungi are 
insufficient. Therefore it is difficult to distinguish fungi from man or to selectively kill fungi, which is 
responsible for the delay in the development of drugs for mycoses. 

Recently the application of genetic engineering techniques such as antisense or PCR to the treatment 

20 and diagnosis of mycoses has been expected. However known genes which are applicable thereto and/or 
proteins coded for by these genes are rare (PCT Pamphlet W092/03455). Regarding pathogenic fungi, there 
have been cloned in recent years an acid protease gene, which has been assumed to participate in the 
pathogenicity of Candida albicans (hereinafter referred to simply as C. albicans ) and Candida tropicalis - 
(hereinafter referred to as C. tropicalis ) causing candidiasis [B. Hube et al., J. Med. Vet. Mycol., 29, 129 - 

25 132 (1991); Japanese Patent Laid-Open No. 49476/1993; and G. Togni et al., FEBS Letters, 286, 181 - 185 
(1991)], a calmodulin gene of C. albicans [S.M. Saporito et al., Gene, 106, 43 - 49 (1991)] and a glycolytic 
pathway enzyme enolase gene of C. albicans [P. Sundstrom et a!., J. Bacteriology, 174, 6789 - 6799 
(1991)]. However, each of these genes and proteins coded for thereby is either indistinguishable from 
nonpathogenic fungi and eukaryotic organisms other than fungi or, if distinguishable therefrom, cannot serve 

30 as a definite action point for exhibiting any selective toxicity. Aureobasidin [Japanese Patent Laid-Open No. 
138296/1990, No. 22995/1991. No. 220199/1991 and No. 279384/1993. Japanese Patent Application No. 
303177/1992, J. Antibiotics, 44 (9), 919 - 924, ibid., 44 (9), 925 - 933, ibid., 44 (11), 1187 - 1 198 (1991)] is a 
cyclic depsipeptide obtained as a fermentation product of a strain Aureobasidium pullulans No. R106. It is 
completely different in structure from other antimycotics. As Tables 1 and 2 show, aureobasidin A, which is 

35 a typical aureobasidin compound, exerts a potent antimycotic activity on various yeasts of the genus 
Candida including C. albicans which is a pathogenic fungus, Cryptococcus neoformans . Histoplasma 
capsulatum , Blastomyces dermatitidis and fungi of the genus Aspergillus (Japanese Patent Laid-open No. 
138296/1990) but has an extremely low toxicity in mammal. Thus this compound is expected to be useful 
as an antimycotic being excellent in selective toxicity. 

40 Hereinafter, Candida , Cryptococcus and Aspergillus will be abbreviated respectively as C., Cr. and A. 
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Table 1 





Test strain 


TIMM No. 


MIC(M.g/ml) 


5 


C. albicans 


UloD 






O. alUlCculo Veil. SlolleUUIUca 


nnn 

1 JUO 


<;o 04 




C. tropicaiis 


0312 


0.08 




C. kefyr 


0298 


0.16 




C. parapsilosis 


0287 


0.16 


70 


C. krusei 


0270 


£0.04 




C. guiliiermondii 


0257 


0.08 




C. glabrata 


1062 


£0.04 




Cr. neoformans 


0354 


0.63 




Cr. terreus 


0424 


0.31 


75 


Rhodotorula rubra 


0923 


0.63 




A, fumigatus 


0063 


20 




A. clavatus 


0056 


0.16 



Test strain 


TIMM No. 


MIC(ug/ml) 


A. nidulans 


0112 


0.16 


A. terreus 


0120 


5 


Penicillium commune 


1331 


1.25 


Trichophyton mentagrophytes 


1189 


10 


Epidermophyton floccosum 


0431 


2.5 


Fonsecaea pedrosoi 


0482 


0.31 


Exophiala werneckii 


1334 


1.25 


Cladosporium bantianum 


0343 


0.63 


Histoplasma capsulatum 


0713 


0.16 


Paracoccidioides brasiliensis 


0880 


0.31 


Geotrichum candidum 


0694 


0.63 


Blastomyces dermatitidis 


0126 


0.31 



[Problems to be Solved by the Invention] 

40 

Each of the conventional antimycotics with a weak toxicity shows only a fungistatic effect, which has 
been regarded as a clinical problem. In contrast, aureobasidin has a fungicidal effect. From this point of 
view, it has been urgently required to clarify the mechanism of the selective toxicity to fungi of 
aureobasidin. However this mechanism still remains unknown. 

45 Under these circumstances, the present invention aims at finding a novel protein regulating 
aureobasidin sensitivity through the clarification of the mechanism of the selective toxicity to fungi of 
aureobasidin. Accordingly, the present invention aims at finding a gene coding for a protein regulating 
aureobasidin sensitivity, providing a process for cloning this gene and the protein regulating aureobasidin 
sensitivity which is encoded by this gene, further providing an antisense DNA and an antisense RNA of this 

so gene, providing a nucleic acid probe being hybridizable with this gene, providing a process for detecting 
this gene with the use of the nucleic acid probe, providing a process for producing the protein regulating 
aureobasidin sensitivity by using this gene and providing an antibody against the protein regulating 
aureobasidin sensitivity, and a process for detecting the protein regulating aureobasidin sensitivity by using 
this antibody. 

55 



3 



EP 0 644 262 A2 



[Means for Solving the Problems] 

The present invention may be summarized as follows. Namely, the first aspect of the present invention 
relates to an isolated gene coding for a protein regulating aureobasidin sensitivity, that is, a gene regulating 
aureobasidin sensitivity. The second aspect of the invention relates to a process for cloning a gene 
regulating aureobasidin sensitivity which is characterized by using the gene regulating aureobasidin 
sensitivity of the first aspect or a part thereof as a probe. The third aspect of the invention relates to a 
nucleic acid probe which is hybridizable with a gene regulating aureobasidin sensitivity and comprises a 
sequence consisting of 15 or more bases. The fourth aspect of the invention relates to an antisense DNA of 
a gene regulating aureobasidin sensitivity. The fifth aspect of the invention relates to an antisense RNA of a 
gene regulating aureobasidin sensitivity. The sixth aspect of the invention relates to a recombinant plasmid 
having a gene regulating aureobasidin sensitivity contained therein. The seventh aspect of the invention 
relates to transformant having the above-mentioned plasmid introduced thereinto. The eighth aspect of the 
invention relates to a process for producing a protein regulating aureobasidin sensitivity by using the above- 
mentioned transformant. The ninth aspect of the invention relates to an isolated protein regulating 
aureobasidin sensitivity. The tenth aspect of the invention relates to an antibody against a protein regulating 
aureobasidin sensitivity. The eleventh aspect of the invention relates to a process for detecting a protein 
regulating aureobasidin sensitivity by using the above-mentioned antibody. The twelfth aspect of the 
invention relates to a process for detecting a gene regulating aureobasidin sensitivity by the hybridization 
which is characterized by using the nucleic acid probe of the third aspect of the present invention. The 
thirteenth aspect of the invention relates to a process for screening an antimycotic by using the above- 
mentioned transformant or a protein regulating aureobasidin sensitivity. 

The present inventors have found out that fungi such as Schizosaccharomyces pombe (hereinafter 
referred to simply as Schizo. pombe ) and Saccharomyces cerevisiae (hereinafter referred to simpiy as S. 
cerev i S i ae ) an d, further, mammalian cells such as mouse lymphoma EL-4 cells are sensitive to 
aureobasidin, as Table 3 shows. 

Table 3 



Test strain or cell 


MIC(ug/ml) 


Schizo. pombe 


0.08 


S. cerevisiae 


0.31 


mouse lymphoma EL-4 


10 


mouse lymphoma L5178Y 


100 


NRK-49F 


12.5 



The present inventors have mutagenized a wild-type strain of Schizo. pombe or S. cerevisiae , sensitive 
to aureobasidin, to thereby give resistant mutants. We have further successfully isolated a gene capable of 
confering aureobasidin resistance (a resistant gene) from these resistant mutants and another gene capable 
of imparting aureobasidin sensitivity (a sensitive gene) from the corresponding sensitive cells. Furthermore, 
We have disclosed the existence of a protein encoded by each of these genes. By culturing cells which 
have been transformed by introducing the above-mentioned gene, We have succeeded in the expression of 
this gene. Furthermore, We have successfully found out a novel gene regulating aureobasidin sensitivity 
from another fungus being sensitive to aureobasidin by using a DNA fragment of the above-mentioned gene 
as a probe. In addition, We have clarified that the gene regulating aureobasidin sensitivity is essentially 
required for the growth of the cells and found out that the detection of this gene or a protein which is a 
gene product thereof with an antibody enables the diagnosis of diseases caused by these cells, for 
example, mycoses induced by fungi, and that an antisense DNA or an antisense RNA, which inhibits the 
expression of the gene regulating aureobasidin sensitivity being characteristic to the cells, is usable as a 
remedy for diseases caused by these cells, for example, mycoses induced by fungi, thus completing the 
present invention. 

That is to say, pathogenic fungi listed in Tables 1 and 2 and fungi and mammalian cells listed in Table 
3, each having a sensitivity to aureobasidin, each carries a protein regulating aureobasidin sensitivity and a 
gene coding for this protein. The term "a protein regulating aureobasidin sensitivity" as used herein means 
a protein which is contained in an organism having a sensitivity to aureobasidin. This protein is required for 
the expression of the sensitivity or resistance to aureobasidin. As a matter of course, a protein having 35% 
or more homology with the above-mentioned protein and having a similar function is also a member of the 
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protein regulating aureobasidin sensitivity according to the present invention. Furthermore, proteins obtained 
by modifying these proteins by the genetic engineering procedure are members of the protein regulating 
aureobasidin sensitivity according to the present invention. A gene regulating aureobasidin sensitivity means 
a gene which codes for such a protein regulating aureobasidin sensitivity as those described above and 

5 involves both of sensitive genes and resistant genes. 

The first aspect of the present invention relates to a gene regulating aureobasidin sensitivity. This gene 
can be isolated in the following manner. First, aureobasidin sensitive cells (a wild-type strain) is mutageniz- 
ed to thereby induce a resistant strain. From chromosome DNA or cDNA of this resistant strain, a DNA 
library is prepared and a gene capable of confering a resistance (a resistant gene) is cloned from this 

io library. Then a DNA library of a wild strain is prepared and a DNA molecule being hybridizable with the 
resistant gene is isolated from this library and cloned. Thus a sensitive gene can be isolated. 

The mutagenesis is performed by, for example, treating with a chemical such as ethylmethane sulfonate 
(EMS) or N-methyl-N'-nitro-N-nitrosoguanidine (MNNG) or by ultraviolet or other radiation. The cell that has 
acquired the resistance can be screened by cuituring the mutagenized cells in a nutritional medium 

75 containing aureobasidin at an appropriate concentration under appropriate conditions. The resistant strain 
thus obtained may vary depending on the method and conditions selected for the mutagenesis. Also, 
strains differing in the extent of resistance from each other can be separated by changing the aureobasidin 
concentration or a temperature-sensitive resistant strain can be isolated by changing the temperature in the 
step of screening. There are a number of mechanisms of resistance to aureobasidin. Accordingly, a number 

20 of resistant genes can be isolated by genetically classifying these various resistant strains. In the case of a 
yeast, the classification may be performed by the complementation test. Namely, resistant strains are 
prepared from haploid cells. Next, diploid cells can be obtained by crossing resistant strains differing in 
mating type from each other. Then spores formed from these diploids are examined by the tetrad analysis. 
As typical examples of the genes regulating aureobasidin sensitivity(named aur) according to the 

25 present invention, aurl and aur2 genes may be cited. Typical examples of the aurl gene include spaurl 
gene isolated from Schizo. pombe and scaurl gene isolated from S. cerevisiae , while typical examples of 
the aur2 gene include scaur2 gene isolated from S. cerevisiae . Now, resistant genes (spaurl R , scaurl R and 
scaur2 R ) isolated from resistant mutants by the present inventors and sensitive genes (spaurl s , scaurl s and 
scaur2 s ) isolated from sensitive wild-type strains will be described. 

30 Fig. 1 shows a restriction enzyme map of the genes spaurl R and spaur1 s regulating aureobasidin 
sensitivity, Fig. 2 shows a restriction enzyme map of scaurl R and scaurl s and Fig. 3 shows a restriction 
enzyme map of scaur2 R and scaur2 s . 

Schizo. pombe , which is sensitive to aureobasidin, is mutagenized with EMS and a genomic library of 
the resistant stain thus obtained is prepared. From this library, a DNA fragment containing a resistant gene 

35 (spaurl R ) and having the restriction enzyme map of Fig. 1 is isolated. This gene has a nucleotide sequence 
represented by SEQ ID No. 1 in Sequence Listing. The amino acid sequence of a protein encoded by this 
gene, which is estimated on the basis of this nucleotide sequence, is the one represented by SEQ ID No. 2 
in Sequence Listing. By the hybridization with the use of this resistant gene as a probe, a DNA fragment 
containing a sensitive gene (spaurl s ) and having the restriction enzyme map of Fig. 1 is isolated from a 

40 sensitive strain. This gene has a nucleotide sequence represented by SEQ ID No. 3 in Sequence Listing. 
The amino acid sequence of a protein encoded by this gene, which is estimated on the basis of this 
nucleotide sequence, is the one represented by SEQ ID No. 4 in Sequence Listing. A comparison between 
the sequences of SEQ ID No. 3 and SEQ ID No. 1 reveals that a mutation from G to T occurs at the base at 
the position 1053, while a comparison between the sequences of SEQ ID No. 4 and SEQ ID No. 2 reveals 

45 that glycine at the residue 240 is converted into cysteine at the amino acid level, thus giving rise to the 
resistance. 

Also, S. cerevisiae, which is sensitive to aureobasidin, is mutagenized with EMS and genomic libraries 
of two resistant strains thus obtained are prepared. From one of these libraries, a DNA fragment containing 
a resistant gene (scaurl R ) as a dominant mutant and having the restriction enzyme map of Fig. 2 is 

50 isolated, while a DNA fragment containing a resistant gene (scaur2 R ) and having the restriction enzyme map 
of Fig. 3 is isolated from another library. 

The nucleotide sequence of the coding region for the protein of the scaurl R gene is the one 
represented by SEQ ID No. 5 in Sequence Listing. The amino acid sequence of the protein encoded by this 
gene, which is estimated on the basis of the above nucleotide sequence, is the one represented by SEQ ID 

55 No. 6 in Sequence Listing. By the hybridization with the use of this resistant gene scaurl R as a probe, a 
DNA fragment containing a sensitive gene (scaurl s ) and having the restriction enzyme map of Fig. 2 is 
isolated from a sensitive strain. This gene has a nucleotide sequence represented by SEQ ID No. 7 in 
Sequence Listing. The amino acid sequence of a protein encoded by this gene, which is estimated on the 
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basis of this nucleotide sequence, is the one represented by SEQ ID No. 8 in Sequence Listing. A 
comparison between the sequences of SEQ ID No. 7 and SEQ ID No. 5 reveals that a mutation from T to A 
occurs at the base at the position 852, while a comparison between the sequences of SEQ ID No. 8 and 
SEQ ID No. 6 reveals that phenylalanine at the residue 158 is converted into tyrosine at the amino acid 

5 level, thus giving rise to the resistance. The spaurl gene has a 58% homology with the scaurl gene at the 
amino acid level. Thus it is obvious that they are genes coding for proteins having similar functions to each 
other. When genes and proteins being homologous in sequence with the spaurl and scaurl genes and with 
the proteins encoded thereby are searched from a data base, none having a homology of 35% or above is 
detected. Accordingly, it is clear that these genes and the proteins encoded thereby are novel molecules 

io which have never been known hitherto. 

By the hybridization with the use of the DNA fragment of the resistant gene scaur2 R as a probe, a DNA 
fragment containing a sensitive gene (scaur2 s ) and having the restriction enzyme map of Fig. 3 is isolated 
from a sensitive strain. 

The nucleotide sequence of this sensitive gene is the one represented by SEQ ID No. 9 in Sequence 

75 Listing and the amino acid sequence of the protein encoded by this gene, which is estimated on the basis 
of this nucleotide sequence, is the one represented by SEQ ID No. 10 in Sequence Listing. As the result of 
the homology search with the scaur2 s gene and the protein encoded thereby, it has been found out that 
cystic fibrosis transmembrane conductance regulator (CFTR) of mammals alone has a homology as low as 
31%. Compared with this CFTR, however, the part having a high homology is limited to the region around 

20 the domain of the nucleotide binding. It is therefore obvious that the protein encoded by the scaur2 s gene is 
a protein which is completely different from CFTR in function and has never been known hitherto. 

In order to prove the importance of the aurl gene in the growth of cells, genes for disrupting the aurl 
as shown in Fig. 4 and Fig. 5, in which genes coding for orotidine-5'-phosphate decarboxylase (ura4 + in the 
case of Schizo. pombe , while URA3 in the case of S. cerevisiae ) have been introduced midway in the aurl 

25 gene, are prepared. When these aurl disrupted genes are introduced into Schizo. pombe and S. cerevisiae 
respectively, the cells having the aurl disrupted genes cannot grow at all. Thus it has been revealed that 
these genes and the proteins encoded thereby are essentially required for the growth of the yeast cells. 

As the above examples clearly show, a gene regulating aureobasidin sensitivity can be isolated by 
using a organism having sensitivity to aureobasidin as a starting material and by carrying out the cloning 

30 with the use of various mutagenesis methods and/or screening methods depending on the organisms or the 
methods. Also, genes being hybridizable with the above-mentioned genes are involved in the scope of the 
first aspect of the present invention. A gene regulating aureobasidin sensitivity can be isolated by the 
following method. The genomic DNA library of an organism having sensitivity to aureobasidin is integrated 
into, for example, a high-expression vector of a yeast and transformed into the yeast. Then a clone having 

35 aureobasidin resistance is selected from the transformants and DNA is recovered from this clone. Thus the 
resistant gene can be obtained. As a matter of course, genes obtained by modifying some part of the gene 
regulating aureobasidin sensitivity thus obtained by some chemical or physical methods are involved in the 
scope of the first aspect of the present invention. 

The second aspect of the present invention relates to a process for cloning a gene regulating 

40 aureobasidin sensitivity which is characterized by using the gene regulating aureobasidin sensitivity of the 
first aspect of the present invention or a part thereof as a probe. Namely, by screening by the hybridization 
method or the polymerase chain reaction (PCR) method with the use of a part (consisting of at least 15 
oligonucleotides) or the whole of the gene as obtained above, a gene coding for a protein having a similar 
function can be isolated. 

45 For example, a pair of primers of SEQ ID No. 11 and SEQ ID No. 12 in Sequence Listing are 
synthesized on the basis of the DNA nucleotide sequence of the spaurl R gene represented by SEQ ID No. 
1 . Then PCR is performed by using cDNA of C. albicans , which is a pathogenic fungus, as a template with 
the use of the above-mentioned primers. The PCR is carried out and the PCR products are electrophoresed 
on an agarose gel and stained with ethidium bromide. In Fig. 6, the lanes 1, 2 and 3 show the results 

50 obtained by using cDNA of C. albicans , cDNA of S. cerevisiae and cDNA of Schizo. pombe as a template, 
respectively. As shown in Fig.6, a certain DNA fragment is specifically amplified. 

By screening the genomic DNA library of C. albicans with the use of this DNA fragment as a probe, a 
DNA molecule having a gene (caaurl), which has the same function as that of the spaurl and scaurl genes 
and having the restriction enzyme map of Fig. 7 is obtained. The nucleotide sequence of this caaurl gene 

55 is the one represented by SEQ ID No. 13 in Sequence Listing and the amino acid sequence of the protein 
encoded by this gene, which has been estimated on the basis of the above nucleotide sequence, is the one 
represented by SEQ ID No. 14 in Sequence Listing. It has a high homology with the proteins encoded by 
the spaurl and scaurl genes. 

6 
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By screening the genomic DNA library of C. albicans with the use of a DNA fragment comprising the 
whole length or a part of the scaur2 s gene represented by SEQ ID No. 9 in Sequence Listing as a probe, a 
DNA fragment containing gene (caaur2), which has the same function as that of the scaur2 gene, and 
having the restriction enzyme map of Fig. 8 is obtained. The nucleotide sequence of a part of this caaur2 

5 gene is the one represented by SEQ ID No. 15 in Sequence Listing and the amino acid sequence of the 
region encoded by this gene, which has been estimated on the basis of this nucleotide sequence, is the 
one represented by SEQ ID No. 16 in Sequence Listing. It has a high homology with the corresponding 
region of the protein encoded by the scaur2 gene. 

The third aspect of the present invention relates to an oligonucleotide comprising 15 or more bases 

10 which serves as the above-mentioned nucleic acid probe and is hybridizable with the gene regulating 
aureobasidin sensitivity, for example, the DNA fragment having the restriction enzyme map as shown in Fig. 
1, Fig. 2 or Fig. 3. This nucleic acid probe is usable in, for example, the hybridization in situ, the 
identification of a tissue wherein the above-mentioned gene can be expressed, and the confirmation of the 
presence of a gene or mRNA in various vital tissues. This nucleic acid probe can be prepared by ligating 

is the above-mentioned gene or a gene fragment to an appropriate vector, introducing it into a bacterium, 
allowing it to replicate in the bacterium, extracting from a disrupted cell suspension, cleaving with a 
restriction enzyme capable of recognizing the vector-ligating site, etectrophoresing and then excising from 
the gel. Alternatively, this nucleic acid probe can be constructed by the chemical synthesis with the use of 
a DNA synthesizer or gene amplification techniques by PCR on the basis of the nucleotide sequence of 

20 SEQ ID. Nos. 1, 3, 5, 7, 9, 13, 15 or 21 in Sequence Listing. This nucleic acid probe can be labeled with a 
radioisotope or a fluorescent substance to thereby elevate the detection sensitivity at the use. 

The fourth aspect of the present invention relates to an antisense DNA of the above-mentioned gene 
regulating aureobasidin sensitivity, while the fifth aspect of the present invention relates to an antisense 
RNA thereof. The introduction of the antisense DNA or antisense RNA into cells makes it possible to control 

25 the expression of the gene regulating aureobasidin sensitivity. 

As examples of the antisense DNA to be introduced, antisense DNAs corresponding to the genes 
regulating aureobasidin sensitivity of SEQ ID Nos. 1, 3, 5, 7, 9, 13, 15 or 21 in Sequence Listing and some 
parts thereof may be cited. SEQ ID No. 17 in Sequence Listing shows an example of this antisense DNA. It 
represents the sequence of an antisense DNA of the gene regulating aureobasidin sensitivity of SEQ ID No. 

30 1 in Sequence Listing. A fragment obtained by appropriately cleaving some part of such an antisense DNA, 
and a DNA synthesized depending on such an antisense DNA sequence may be used as the antisense 
DNA. 

As examples of the antisense RNA to be introduced, antisense RNAs corresponding to the genes 
regulating aureobasidin sensitivity of SEQ ID Nos. 1, 3, 5, 7, 9, 13, 15 or 21 in Sequence Listing and some 

35 parts thereof may be cited. SEQ ID No. 18 in Sequence Listing shows an example of this antisense RNA. It 
represents the sequence of an antisense RNA of the gene regulating aureobasidin sensitivity of SEQ ID No. 
1 in Sequence Listing. A fragment obtained by appropriately cleaving some part of such an antisense RNA, 
an RNA synthesized depending on such an antisense RNA sequence, and an RNA prepared with RNA 
polymerase in an in vitro transcription system by using the DNA corresponding to the gene regulating 

40 aureobasidin sensitivity of SEQ ID No. 1 or SEQ ID No. 3 in Sequence Listing or a part thereof may be 
used as the antisense RNA. 

These antisense DNA and antisense RNA may be chemically modified so as to prevent degradation in 
vivo or to facilitate passage through a cell membrane. A substance capable of inactivating mRNA, for 
example, ribozyme may be linked thereto. The antisense DNA and antisense RNA thus prepared are usable 

45 in the treatment of various diseases such as mycoses accompanied by an increase in the amount of mRNA 
coding for a protein regulating aureobasidin sensitivity. 

The sixth aspect of the present invention relates to a recombinant plasmid having a gene coding for a 
protein regulating aureobasidin sensitivity being integrated into an appropriate vector. For example, a 
plasmid, in which a gene regulating aureobasidin sensitivity gene has been integrated into an appropriate 

so yeast vector, is highly useful as a selection marker gene, since a transformant can be easily selected 
thereby with the guidance of the chemical resistance by using aureobasidin. 

Also, the recombinant plasmid can be stably carried by, for example, Escherichia coli . Examples of 
vectors which are usable in this case include pUC118, pWH5 f pAU-PS, Traplexll9 and pTV118. pAU-PS 
having the spaur1 s gene integrated therein is named pSPARL pWH5 having the spaur1 s gene integrated 

55 therein is named pSCARt. pWH5 having the scaur2 R gene integrated therein is named pSCARL Traplexll9 
vector having the caaurl gene integrated therein is named pCAARL pTV118 vector having a part of the 
caaur2 gene integrated therein is named pCAAR2N. Each of these recombinant plasmids is transformed 
into E. coli. It is also possible to express these plasmids in an appropriate host. Such a gene is reduced 



7 



EP 0 644 262 A2 



exclusively into the open reading frame (ORF) to be translated into a protein by cleaving with an appropriate 
restriction enzyme, if necessary, and then bound to an appropriate vector. Thus an expression recombinant 
plasmid can be obtained. When E. coli is used as the host, plasmids such as pTV118 may be used as a 
vector for the expression plasmid. When a yeast is used as the host, plasmids such as pYES2 may be used 
s as the vector. When mammalian cells are used as the host, plasmids such as pMAMneo may be used as 
the vector. 

The seventh aspect of the present invention relates to a transformant having the above-mentioned 
recombinant plasmid which has been introduced into an appropriate host. As the host, E. coli, yeasts and 
mammalian cells are usable. E. coli JM109 transformed by pSPARI having the spaur1 s gene integrated 

70 therein has been named and designated as Escherichia coli JM109/pSPAR1 and deposited at National 
Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology (1-3, Higashi 
1 chome Tsukuba-shi Ibaraki-ken 305, JAPAN), in accordance with the Budapest Treaty under the 
accession number FERM BP-4485. E. coli HB101 transformed by pSCARI having the scaur1 s gene 
integrated therein has been named and designated as Escherichia coli HB101/pSCAR1 and deposited at 

75 National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology in 
accordance with the Budapest Treaty under the accession number FERM BP-4483. E. coli HB101 
transformed by pSCAR2 having the scaur2 R gene integrated therein has been named and designated as 
Escherichia coli HB101/pSCAR2 and deposited at National Institute of Bioscience and Human-Technology, 
Agency of Industrial Science and Technology in accordance with the Budapest Treaty under the accession 

20 number FERM BP-4484. E. coli HB101 transformed by pCCARI having the caaurl s gene integrated therein 
has been named and designated as Escherichia coli HB101/pCAAR1 and deposited at National Institute of 
Bioscience and Human-Technology, Agency of Industrial Science and Technology in accordance with the 
Budapest Treaty under the accession number FERM BP-4482. E. coli HB101 transformed by pCAAR2N 
having a part of the caaur2 gene integrated therein has been named and designated as Escherichia coli 

25 HB101/pCAAR2N and deposited at National Institute of Bioscience and Human-Technology, Agency of 
Industrial Science and Technology in accordance with the Budapest Treaty under the accession number 
FERM BP-4481. 

A transformant capable of expressing a protein regulating aureobasidin sensitivity can be obtained by 
transforming a expression recombinant plasmid into an appropriate host, as described above. For example, 
30 a yeast having a recombinant plasmid as shown in Fig. 9 introduced thereinto is usable for this purpose. 

The eighth aspect of the present invention relates to a process for producing a protein regulating 
aureobasidin sensitivity which comprises incubating a transformant according to the sixth aspect of the 
present invention, which contains a gene coding for this protein, in an appropriate nutritional medium, 
allowing the expression of the protein, then recovering the protein from the cells or the medium and 
35 purifying the same. For the expression of the gene coding for this protein, E. coli, a yeast or mammalian 
cells are employed as a host. When the yeast having the recombinant plasmid of Fig. 9 is incubated in a 
medium containing galactose, for example, the protein regulating aureobasidin sensitivity which is encoded 
by the scaur1 s gene can be expressed. 

The ninth aspect of the present invention relates to an isolated protein regulating aureobasidin 
40 sensitivity. As examples of such a protein, those encoded by the above-mentioned spaurl, scaurl, scaur2, 
caaurl and caaur2 genes can be cited. 

The spaurl s gene codes for a protein having an amino acid sequence represented by SEQ ID No. 4 in 
Sequence Listing, while the scaurl s gene codes for a protein having an amino acid sequence represented 
by SEQ ID No. 8 in Sequence Listing. By the northern hybridization with the use of a DNA fragment of the 
45 spaurl gene as a probe, mRNAs are detected from a sensitive strain (Fig. 10). Thus the expression of the 
spaurl gene is confirmed. 

Fig. 10 is an autoradiogram showing the results of the northern hybridization wherein mRNAs obtained 
from cells of a sensitive strain of Schizo. pombe in the logarithmic growth phase (lane 1), cells of a resistant 
strain in the logarithmic growth phase (lane 2), cells of the sensitive strain in the stationary phase (lane 3) 
so and cells of the resistant strain in the stationary phase (lane 4) are electrophoresed on a 1 .2% agarose gel 
containing formaldehyde. 

The tenth aspect of the present invention relates to an antibody against the above-mentioned protein 
regulating aureobasidin sensitivity. For example proteins having amino acid sequences of SEQ ID Nos. 2, 4, 
6, 8, 10, 14, 16 or 22 in Sequence Listing and peptides comprising some parts of these amino acid 
55 sequences may be used as an antigen. The former antigens can be prepared through the expression in a 
transformant followed by purification, while the latter antigens can be synthesized on, for example, a 
marketed synthesizer. The antibody is produced by the conventional method. For example, an animal such 
as a rabbit is immunized with the above-mentioned protein or a peptide fragment together with an adjuvant 
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to thereby give a polyclonal antibody. A monoclonal antibody can be produced by fusing antibody- 
producing B cells, which have been obtained by immunizing with an antigen, with myeloma cells, screening 
hybridomas producing the target antibody, and incubating these cells. As will be described hereinafter, 
these antibodies are usable in the treatment and diagnosis for animal and human diseases in which the 

5 above-mentioned proteins participate, such as mycoses. 

For example, a peptide corresponding to the part of the 103- to 1 1 3-positions in the amino acid 
sequence of SEQ ID No. 8 is synthesized on a synthesizer and then bound to a carrier protein. Then a 
rabbit is immunized therewith and thus a polyclonal antibody is obtained. In the present invention, keyhole 
limpet hemocyanin (KLH) is used as the carrier protein. Alternatively, bovine serum albumin and ovalbumin 

w are usable therefor. 

The eleventh aspect of the present invention relates to a process for detecting a protein regulating 
aureobasidin sensitivity by using the above-mentioned antibody. The detection can be carried out by 
detecting the binding of the antibody to the protein or measuring the amount of binding. For example, the 
protein or the cells producing the same can be detected by treating with a fluorescence-labeled antibody 

75 and then observing under a fluorescence microscope. The amount of the antibody bound to the protein can 
be measured by various known methods. For example, S. cerevisiae cells are stained by the im- 
munofluorescent antibody technique by using the above-mentioned antibody and a secondary antibody 
such as FITC-labeled antirabbit antibody. Thus it is clarified that the protein encoded by the scaurl gene is 
distributed all over the cells. Further, a yeast having the recombinant plasmid of Fig. 9 introduced thereinto 

20 is incubated in a medium containing glucose or galactose. The cells thus obtained are disrupted with glass 
beads and proteins are solubilized. Then these proteins are separated by SDS-poiyacrylamide gel 
electrophoresis(SDS-PAGE) and the western blotting is carried out in the conventional manner by using the 
above-mentioned polyclonal antibody and peroxidase-labeled anti-rabbit antibody. Consequently, the protein 
encoded by the scaurl gene can be detected, as Fig. 11 shows. 

25 Fig. 11 shows the results of the western blotting wherein the proteins prepared from the cells obtained 
by the incubation in the presence of glucose (lane 1) or galactose (lane 2) are subjected to SDS-PAGE. A 
main band binding to the polyclonal antibody of the present invention is detected at around 38 kDa. 

The twelfth aspect of the present invention relates to a process for detecting a gene regulating 
aureobasidin sensitivity, for example, mRNA at the expression of a protein, by using the above-mentioned 

30 oligonucleotide as a nucleic acid probe. This process is applicable to the diagnosis for various diseases, 
including mycoses, associated with an abnormal amount of mRNA coding for the protein. For example, 
nucleic acids are precipitated from disrupted cells and mRNA is hybridized with a radioisotope-labeled 
nucleic acid probe on a nitrocellulose membrane. The amount of binding can be measured by autoradiog- 
raphy (Fig. 10) or with a scintillation counter. 

35 The thirteenth aspect of the present invention relates to a process for efficient screening of a novel 
antimycotic by using the transformant of the seventh aspect of the present invention or the protein 
regulating aureobasidin sensitivity of the ninth aspect of the present invention. For example, a drug exerting 
its effect on the protein or the gene of the present invention can be efficiently found out through a 
comparison of the activity on a transformant containing a sensitive gene with the activity on a transformant 

40 containing a resistant gene or a comparison between the activities on transformants differing in expression 
level from each other. Also, the screening can be efficiently carried out by measuring the affinity for the 
protein of the present invention, for example, the activity of inhibiting the binding of radiolabeled- 
aureobasidin to the protein. 

45 [Brief Description of the Drawings] 
[Fig. i] 

Restriction enzyme map of the genes spaur1 R and spaur1 s regulating aureobasidin sensitivity. 

50 

[Fig. 2] 

Restriction enzyme map of scaurl R and scaurl s . 
55 [Fig. 3] 

Restriction enzyme map of scaur2 R and scaur2 s . 
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[Fig. 4] 

Structure of a DNA for disrupting the Schizo. pombe spaur1 s gene. 
5 [Fig. 5] 

Structure of a DNA for disrupting the S. cerevisiae scaur1 s gene. 
[Fig. 6] 

w 

Results of the detection of the aurl gene caaurl carried by C. albicans by the PCR method. 
[Fig. 7] 

75 Restriction enzyme map of the caaurl gene carried by C. albicans . 
[Fig. 8] 

Restriction enzyme map of the caaur2 gene. 

20 

[Fig. 9] 

Structure of a plasmid YEpSCARW3 for expressing the scaurl gene. 
25 [Fig. 10] 

Results of the northern hybridization of the spaurl gene of Schizo. pombe . 
[Fig. 11] 

30 

Results of the detection of the scaurl protein by using an antibody. 
[Fig. 12] 

35 Restriction enzyme map of pAR25. 
[Examples] 

To further illustrate the present invention in greater detail, the following Examples will be given. 
40 However it is to be understood that the present invention is not restricted thereto. 

Example 1 : Cloning of a gene regulating aureobasidin sensitivity originating in fission yeast Schizo. pombe 

1-a) Separation of aureobasidin-resistant mutant of Schizo. pombe 

45 

About 1 x 10 8 cells of a Schizo. pombe haploid cell strain JY745 (mating type h~, genotype ade6- 
M210, Ieu1, ura4-D18) exhibiting a sensitivity to aureobasidin at a concentration of .0.08 ug/ml were 
suspended in 1 ml of a phosphate buffer containing 0.9% NaCI. Then the cells were mutagenized with EMS 
at a final concentration of 3% at 30 *C for 90 minutes. After neutralizing by adding 8 ml of 5% sodium 

so thiosulfate, the cells thus treated were harvested by centrifugation (2500 r.p.m., 5 minutes), washed twice 
with 6 ml of physiological saline and then suspended in 2 ml of a YEL medium (3% of glucose, 0.5% of 
yeast extract). The suspension was incubated at 30 • C for 5 hours under stirring and then spread on a YEA 
plate (the YEL medium containing 1 .5% of agar) containing 5 ug/ml of aureobasidin A. After incubating at 
30 *C for 3 to 4 days, two or three aureobasidin-resistant colonies were formed per 1 x 10 s cells. After 

55 carrying out the mutagenesis several times, five clone mutants, i.e., THR01, THR04, THR05, THR06 and 
THR07 were obtained. These mutants were resistant to more than 25 ug/ml of aureobasidin A but the same 
as the parent strain in the sensitivity to cycloheximide and amphotericin B. Therefore it is estimated that 
they are not mutants having a multiple drug resistance but ones having a resistance specific to 
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aureobasidin. 

1 -b) Genetic analysis 

5 Each of the above-mentioned resistant strains THR01, THR04, THR05, THR06 and THR07 was crossed 
with normal cells of Schizo. pombe LH121 strain (mating type h + , genotype ade6-M216. ura4-D18) differing 
in mating type. Diploid cells obtained were examined about the resistance to aureobasidin. Similar to the 
resistant strains, the five diploids formed by crossing the resistant strains with the normal one were resistant 
to 25 ug/ml of aureobasidin A, thus proving that these resistant mutations were dominant. To perform the 

70 tetrad analysis, the above-mentioned diploids were subsequently inoculated on an MEA medium (3% of 
malt extract, 2.5% of agar) for sporulation and incubated at 25 • C for 2 days. Prior to the meiosis, the 
diploid cells replicated DNA on the MEA medium and then underwent the meiosis to form asci each 
containing four ascospores of the haploid. These spores were separated with a micromanipulator and 
allowed to germinate on the YEA plate, followed by the formation of colonies. Then the resistance to 

75 aureobasidin of these colonies was examined. Among four spores contained in an ascus, the separation of 
the sensitivity versus the resistance showed 2 : 2. This result indicates that the aureobasidin resistant 
mutation was induced by a mutation in single gene. Further, the complementation test was performed in 
order to confirm whether the resistant genes of the above-mentioned five mutants were identical with each 
other or not. For example, a mutant of the mating type h + , which had been obtained by crossing the mutant 

20 THR01 with the LH121 strain in the above tetrad analysis, was crossed with another variant THR04 (mating 
type h") on the MEA plate as described above and, after sporulation, the tetrad analysis was carried out. As 
a result, all of the colonies formed from four ascospores showed resistance to aureobasidin, which indicates 
that the mutational genes of THR01 and THR04 are the same with each other. Similarly, the five mutants 
were examined and it was thus found out that all mutations occurred on the same gene. This gene 

25 regulating aureobasidin sensitivity is named spaurl, the normal gene (sensitive gene) is named spaur1 s and 
the mutational gene (resistant gene) is named spaurl R . 

l-c) Preparation of genomic library of aureobasidin resistant strain 

30 Genomic DNA was extracted and purified from the aureobasidin resistant strain THR01 by the method 
of P. Philippsen et al. [Methods in Enzymology, 194. 169 - 175 (1991)]. The purified genomic DNA (8 ug) 
was partially digested by treating with 5 U of a restriction enzyme Hindlll at 37 *C for 10 minutes, 
deproteinized with phenol/chloroform and precipitated with ethanol. The partially digested DNA was 
electrophoresed on a 0.8% agarose gel and DNA in the region of 3 to 15 kb was extracted. The DNA thus 

35 obtained was ligated with a yeast-E. coli shuttle vector pAU-PS (2 ug) which had been completely digested 
with Hindlll by using a DNA ligation kit (manufactured by Takara Shuzo Co., Ltd.) and then transformed into 
E. coli HB101. Thus a genomic library of the aureobasidin resistant strain was formed. E. coli containing this 
genomic library was incubated in 50 ml of an LB medium (1 % of bacto trypton, 0.5% of bacto yeast extract, 
0.5% of sodium chloride) containing 100 ug/ml of ampicillin and 25 ug/ml of tetracycline at 37 • C overnight. 

40 Then a plasm id was recovered and purified from the E. coli cells. 

1-d) Expression and cloning of aureobasidin resistant gene spaurl R 

The plasmid originating in the genomic library of the aureobasidin resistant strain as prepared above 
45 was transformed into a strain Schizo. pombe JY745 by the method of Okazaki et al. [Nucleic Acid 
Research, 18, 6485 - 6489 (1990)]. The transformed cells were spreaded on a minimum medium SD plate 
[0.67% of yeast nitrogen base without amino acids (manufactured by Difco), 2% of glucose, 2% of agar] 
containing 75 ug/ml of adenine sulfate and 50 ug/ml of leucine. After incubating at 30 • C for 3 to 4 days, 
the colonies thus formed were replicated onto an SD plate containing 5 ug/ml of aureobasidin A, 75 ug/ml 
so of adenine sulfate and 50 ug/ml of leucine. It is conceivably that a colony propagated on this plate may 
have the plasmid containing the aureobasidin resistant gene. This colony was inoculated into 5 ml of a 
liquid SD medium containing 75 ug/ml of adenine sulfate and 50 ug/ml of leucine. After incubating at 30 • C 
for 2 days, the plasmid was recovered from the propagated cells by the method of I. Hagan et al. [J. Cell 
Sci., 91^, 587 - 595 (1988)] . Namely, the cells were harvested from the culture (5 ml) by centrifugation and 
55 then suspended in 1.5 ml of 50 mM citrate/phosphate buffer containing 1.2 M of sorbitol and 2 mg/ml of 
Zymolyase. Then the suspension was maintained at 37 'C for 60 minutes. The cells were collected by 
centrifuging at 3,000 r.p.m. for 30 seconds and suspended in 300 ul of a TE [10 mM of Tris-HCI, pH 8, 1 
mM of EDTA] solution. After adding 35 ul of 10% SDS, the mixture was maintained at 65 "C for 5 minutes. 
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After adding 100 ul of 5 M potassium acetate, the mixture was allowed to stand in ice for 30 minutes. Then 
it was centrifuged at 10,000 r.p.m. at 4*C for 10 minutes and a plasmid DNA was purified from the 
supernatant by using EASYTRAP™ (manufactured by Takara Shuzo Co., Ltd.). 

This plasmid was transformed into E. coli HB101 and a plasmid DNA was prepared from E. colt 

5 colonies formed on an LB medium containing ampicillin and tetracycline. This plasmid, which contained a 
DNA of 4.5 kb, was named pAR25. Fig. 12 shows the restriction enzyme map of the DNA of 4.5 kb in 
pAR25. To specify the gene region, Hindlll fragments or Sad fragments of various sizes were subcloned 
into the pAU-PS vector. These DNAs were transformed into normal JY745 cells by the above-mentioned 
method of Okazaki et al. and the acquisition of aureobasidin resistance was examined. As a result, it is 

70 revealed that a Hindlll-Sacl 2.4 kb DNA fragment contains the spaur1 R gene. The restriction enzyme map of 
this DNA segment containing the aureobasidin resistant gene spaurl R is shown in Fig. 1. This fragment was 
cloned into a pUC118 vector (named pUARS2R) and then the DNA nucleotide sequence was identified 
(SEQ ID No. 1 in Sequence Listing). From this nucleotide sequence, it is revealed that the spaurl R gene 
code for a protein having an amino acid sequence represented by SEQ ID No. 2 in Sequence Listing. 

75 

1- e) Cloning of aureobasidin sensitive gene spaurl s 

By the same method as the one employed in the above c), genomic DNA was extracted and purified 
from normal cells. After partially digesting with Hindlll, a genomic library of the normal cells was 
20 constructed. An E. coli stock containing this library DNA was spread on an LB agar medium containing 
ampicillin and tetracycline and incubated overnight at 37 'C. The colonies thus formed were transferred 
onto a nylon membrane (Hybond™-N, manufactured by Amersham) and the colony hybridization was 
performed. 

As a probe, the above-mentioned DNA fragment (2.4 kb) obtained by cleaving the spaurl R gene with 

25 Hindlll-Sacl and labeled with [a- 32 p] dCTP by using a random primer DNA labeling kit (manufactured by 
Takara Shuzo Co., Ltd.) was used. As the results of screening of 5 x 10* colonies, five clones being 
hybridizable with the probe were obtained. Piasmids were purified from E. coli cells of these five clones. As 
the result of the cleavage with restriction enzymes, it was found out that all of these clones contained the 
same DNA fragment of 4.5 kb (named pARN1). The restriction enzyme map of the DNA of 4.5 kb in pARN1 

30 was identical with that of pAR25 shown in Fig. 10. Therefore, a Hind lll-Sacl 2.4 kb DNA fragment which was 
a region containing the spaurl s gene was prepared from pARN1. Then it was cloned into the pAU-PS 
vector and this plasmid was named pSPARI . 

By using this plasmid pSPARI, a strain E. coli JM109 was transformed and the transformant thus 
obtained was named and designated as Escherichia coli JM109/pSPAR1. It has been deposited at National 

35 Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology in accordance 
with the Budapest Treaty under the accession number FERM BP-4485. This DNA fragment containing the 
aureobasidin sensitive gene spaurl s had the restriction enzyme map shown in Fig. 1 and the DNA 
nucleotide sequence thereof was the one represented by SEQ ID No. 3 in Sequence Listing. Based on this 
nucleotide sequence, it has been revealed that the spaurl s gene codes for a protein having the amino acid 

40 sequence represented by SEQ ID No. 4 in Sequence Listing and, when compared with the resistant gene 
spaurl R , the amino acid at the residue 240 has been changed from glycine into cysteine. 

Example 2: Cloning of aureobasidin sensitive genes scaurl and scaur2 originating in budding yeast S. 
cereyisiae 

45 

2- a) Separation of aureobasidin resistant mutant of S. cerevisiae 

A strain S. cerevisiae DKD5D (mating type a, genotype leu2-3 112, trpl, his3) having a sensitivity to 
aureobasidin at a concentration of 0.31 ug/ml was mutagenized with EMS in the same manner as the one 
so employed in the case of Schizo. pombe . Then resistant mutants were separated on an agar plate of a 
complete nutritional medium YPD (1% of yeast extract, 2% of polypeptone, 2% of glucose) containing 5 
ug/ml or 1 .5 ug/ml of aureobasidin A. After repeating the mutagenesis several times, 34 mutant clones were 
obtained. These mutants were resistant to more than 25 ug/ml of aureobasidin A and estimated as having 
not a multiple drug resistance mutation but a aureobasid in-specific resistance mutation. 

55 
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2-b) Genetic analysis 

Similar to the above-mentioned case of Schizo. pombe , the genetic analysis using the tetrad analysis 
and the complementation test was performed. As a result, the genes could be classified into two types. 
5 These genes regulating aureobasidin sensitivity were named scauM and scaur2, the resistant genes 
isolated from the resistant mutant were named scaurl R and scaur2 R , and the sensitive genes isolated from 
the sensitive wild-type strain were named scaurl s and scaur2 s , respectively. 

The R94A strain had a gene with dominant mutation (scaurl R ). It has been further clarified that the 
scaurl gene is located in the neighborhood of the met14 gene of the eleventh chromosome. 

w 

2-c) Preparation of genomic library of aureobasidin resistant strain having aureobasidin resistant gene 
scaurl R 

Genomic DNA was extracted and purified from the aureobasidin resistant strain R94A by the above- 
75 mentioned method of P. Philippsen et al. The purified genomic DNA (8 ug) was partially digested by 
treating with 5 U of a restriction enzyme Hindlll at 37 *C for 10 minutes, deproteinized with phe- 
nol/chloroform and precipitated with ethanol. The partially digested DNA thus obtained was electrophoresed 
on a 0.8% agarose gel and DNA in the region of 3 to 15 kb was extracted. The DNA thus obtained was 
ligated with a yeast-E coli shuttle vector pWH5 (2 ug) which had been completely digested with Hindlll by 
20 using a DNA ligation kit and then transformed into E. coli HB101. Thus a genomic library was formed. E. 
coli containing this genomic library was cultured in 50 ml of an LB medium containing ampiciilin and 
tetracycline at 37 • C overnight. Then a plasmid was recovered and purified from the E. coli cells. 

2-d) Expression and cloning of aureobasidin resistant gene scaurl R 

25 

The above-mentioned genomic library of the R94A strain was transformed into S. cerevisiae SH3328 
(mating type a, genotype ura3-52, his4, thr4, leu2-3 • 112) in accordance with the method of R.H. Schiestl 
et al. [Current Genetics, 16, 339 - 346 (1989)]. The transformed cells were spread on a minimum medium 
SD plate [0.67% of yeast nitrogen base without amino acids, 2% of glucose, 2% of agar] containing 25 

30 ug/ml of uracil, 35 ug/ml of histidine and 500 ug/ml of threonine. After incubating at 30 • C for 3 to 4 days, 
the colonies thus formed were replicated onto a YPD agar plate containing 1 .5 ug/ml of aureobasidin A. A 
colony thus formed was inoculated into 5 ml of a liquid YPD medium. After incubating at 30 *C for 2 days, a 
plasmid DNA was recovered from the propagated cells by the above-mentioned method of I. Hagan et al. 
This plasmid was transformed into a yeast again and it was confirmed that the obtained transformant had 

35 acquired aureobasidin resistance. This plasmid, which contained a DNA of 3.5 kb, was named pWTCR3. 
Neither the DNA fragment of 2.0 kb nor the DNA fragment of 1 .5 kb obtained by cleaving with Hindlll 
exhibited any aureobasidin resistant activity alone. Thus it is confirmed that the gene is contained in the 
DNA fragment of 3.5 kb. Fig. 2 shows the restriction enzyme map of this DNA fragment of 3.5 kb containing 
the aureobasidin resistant gene scaurl R . The Hindlll fragments of 1.5 kb and 2 kb were each cloned into 

40 pUC118, followed by the determination of the DNA nucleotide sequence (SEQ ID No. 5 in Sequence 
Listing). From this nucleotide sequence, it has been revealed that the scaurl R gene codes for a protein 
having an amino acid sequence represented by SEQ ID No. 6 in Sequence Listing. 

2-e) Cloning of aureobasidin sensitive gene scaurl s corresponding to aureobasidin resistant gene scaurl R 

45 

By the same method as the one employed in the above Example 2-c), genomic DNA was extracted and 
purified from the parent strain S. cerevisiae DKD5D. After partially digesting with Hindlll, the DNA was 
ligated with pWH5 and transformed into E. coli HB101. Thus a genomic library of the normal cells was 
formed. An E. coli stock containing this library DNA was spread on an LB agar medium containing ampiciilin 

so and tetracycl[ne~and incubated overnight at 37 °C. The colonies thus formed were transferred onto a nylon 
membrane (Hybond™-N) and the colony hybridization was carried out. As a probe, the DNA fragment of 
3.5 kb obtained in the above Example 2-d) and labeled with [a- 32 P] dCTP by using a random primer DNA 
labeling kit (manufactured by Takara Shuzo Co., Ltd.) was used. As the results of screening of 2 x 10 4 
colonies, seven clones being hybridizable with the probe were obtained. Plasmids were purified from E. coli 

55 cells of these clones. As the result of the cleavage with restriction enzymes, one of these clones contained 
a DNA fragment of 3.5 kb. This DNA fragment had the restriction enzyme map of Fig. 2 and thus judged as 
containing the scaurl s gene. The plasmid containing this DNA fragment was named pSCARI, while E. coli 
HB101 having this plasmid introduced therein was named and designated as Escherichia coli 
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HB101/pSCAR1. This strain has been deposited at National Institute of Bioscience and Human-Technology, 
Agency of Industrial Science and Technology in accordance with the Budapest Treaty under the accession 
number FERM BP-4483. The DNA fragment of 3.5 kb obtained by partially digesting pSCARI with Hindlll 
was subcloned into pUC118 and the nucleotide sequence thereof was determined (SEQ ID No. 7 in 
5 Sequence Listing). A comparison with the resistant gene indicates that the base at the position 852 has 
been changed from T into A and, due to this replacement, the amino acid has been converted from 
phenylalanine into tyrosine (SEQ ID No. 8 in Sequence Listing). 

2-f) Preparation of genomic library of aureobasidin resistant strain having aureobasidin resistant gene 
w scaur2 R 

A genomic library was prepared from an aureobasidin resistant strain L22-8B by the same method as 
the one described in Example 2-c). E. coli containing this genomic library was cultured in an LB medium 
(50 ml) containing ampictllin and tetracycline at 37 *C overnight. Then plasmids were recovered and purified 
75 from the E. coli cells. 

2-g) Expression and cloning of aureobasidin resistant gene scaur2 R 

The above-mentioned plasmids originating in the genomic library of the L22-8B strain were transformed 
20 into S. cerevisiae SH3328 by the above-mentioned method of R.H. Schiestl. From the transformed strains, 
an aureobasidin resistant strain was isolated. Then a plasmid DNA containing the scaur2 R gene was 
recovered from this transformant by the above-mentioned method of I. Hagan et al. This plasmid was 
transformed into a yeast again and it was confirmed that the transformant had acquired aureobasidin 
resistance. This plasmid, which contained a DNA of 8.5 kb, was named pSCAR2. Fig. 3 shows the 
25 restriction enzyme map of the DNA fragment of 8.5 kb containing this aureobasidin resistant gene scaur2 R . 
E. coli HB101 having this plasmid pSCAR2 introduced therein was named and designated as Escherichia 
coli HB101/pSCAR2. This strain has been deposited at National Institute of Bioscience and Human- 
Technology, Agency of Industrial Science and Technology in accordance with the Budapest Treaty under 
the accession number FERM BP-4484. By using BamHI, EcoRI, Hindlll and Pstl, DNA fragments of various 
30 sizes were prepared and cloned into the pWH5 vector. These plasmids were transformed into S. cerevisiae 
DKD5D in accordance with the above-mentioned method of R.H. Schiestl et al. Then it was examined 
whether these transformants had acquired aureobasidin resistance or not. As a result, none of the 
transformants of the DNA fragments was a resistant one. Thus it has been clarified that the DNA fragment 
of the full length is necessary for the expression of the aureobasidin resistance. 

35 

2- h) Isolation of aureobasidin sensitive gene scaur2 s corresponding to aureobasidin resistant gene scaur2 R 

An E. coli stock containing the genomic library of Example 2-e) prepared from normal cells of S. 
cerevisiae DKD5D was spreaded on an LB agar medium containing ampiciliin and tetracycline and 

40 incubated at 37 *C overnight. The colonies thus formed were transferred onto a nylon membrane 
(Hybond TU -N) and the colony hybridization was performed. As a probe the DNA fragment of 8.5 kb 
obtained in the above Example 2-g) and labeled with [a- 32 P] dCTP by using a random primer DNA labeling 
kit was used. As the results of screening of 2 x 10* colonies, several clones being hybridizable with the 
probe were obtained. Some of these clones contained a DNA fragment of 4.6 kb while others contained a 

45 DNA fragment of 3.9 kb. From the restriction enzyme maps of these DNA fragments, it was found out that 
these DNA fragments were each a part of the scaur2 s gene shown in Fig. 3. These DNA fragments were 
ligated together to thereby give a scaur2 s fragments shown in Fig. 3. The DNA fragment of 8.5 kb thus 
obtained was subcloned into pUC118 and then the DNA nucleotide sequence was determined (SEQ ID No. 
9 in Sequence Listing). Based on the nucleotide sequence of SEQ ID No. 9 in Sequence Listing, the amino 

so acid sequence represented by SEQ ID No. 10 in Sequence Listing was estimated. 

Example 3: Gene disruption test on spaur1 s and scaur1 s genes 

3- a) Gene disruption test on spaur1 s gene 

55 

In order to examine whether the aureobasidin sensitive gene spaur1 s is necessary in the cell growth by 
the gene disruption test, the plasmid pUARS2R prepared in Example 1-d) was first cleaved with Ball and 
EcoT22l. After eliminating a DNA fragment of 240 bp, the residual DNA fragment was blunted by using a 
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DNA blunting kit (manufactured by Takara Shuzo Co., Ltd.)- Then this DNA was ligated with a DNA 
containing ura4 + gene of 1.7 kb, which had been obtained by excising from a pUC8ura4 plasmid [Mol. Gen. 
Genet., 215, 81 - 86 (1988)] by cleaving with Hindlll and blunting, to thereby give a plasmid 
pUARS2RBT22::ura4-1 and another plasmid pUARS2RBT22::ura4-6 in which the ura4 DNA had been 

s inserted in the opposite direction. Both of these disrupted genes were excised from the vector pUC118 by 
cleaving with Sacl and Hindlll and ARS2RBT22::ura4-1 and ARS2RBT22::ura4-6 (Fig. 4), which were 
spaur1 s DNA fragments containing ura4\ were purified. The purified DNA fragments were transformed into 
diploid cells Schizo. pombe C525 (h 90 /h 90 , ura4-D18/ura4-D18, Ieu1/leu1, ade6-M210/ade6-M216) by the 
above-mentioned method of Okazaki et al. and then a transformant was screened on an SD agar plate 

70 containing leucine. In the transformant thus obtained, one of a pair of spaur1 s genes on the chromosome 
had been replaced by the disrupted gene ARS2RBT22::ura4-1 or ARS2RBT22::ura4-6 introduced thereinto. 
These cells were allowed to undergo sporulation on a sporulation medium MEA and subjected to the tetrad 
analysis. As a result, it was found out that two of the four ascospores formed colonies but the residual two 
spores formed no colony. That is to say, the spores suffering from the replacement of the normal spaur1 s 

15 gene by the disrupted gene ARS2RBT22::ura4-1 were not propagated. It has been thus revealed that the 
spaur1 s gene is essentially required for the growth of the cells. 

3-b) Gene disruption test on scaur1 s gene 

20 The plasmid pSCARI prepared in Example 2-e) was partially digested with Hindlll to thereby give a 
DNA fragment of 3.5 kb shown in Fig. 2. This DNA fragment was cloned into the Hindlll site of pUC119 and 
the obtained product was named pSCAR3. The obtained. pSCAR3 was cleaved with Stul and EcoT22l. 
After, eliminating a DNA fragment of 0.3 kb, the obtained DNA was ligated with a DNA fragment (1 .1 kb) of 
URA3 gene which had been obtained by cleaving a plasmid pYEUra3 (manufactured by Clontech 

25 Laboratories, Inc.) with Hindlll and EcoRI and blunting. Thus a plasmid pUSCAR3.ST22::URA3 + and another 
plasmid. pUSCAR3.ST22::URA3A, in which the URA3 gene had been inserted in the opposite direction, 
were obtained. These disrupted gene were excised in the Eco RI site in the scaur1 s gene and the Eco RI site 
in the pUC119 vector by cleaving with EcoRI. The scaur1 s DNA fragments containing URA3, 
SCAR3.ST22::URA3 + and SCAR3.ST22::URA3A (Fig. 5), were purified. The purified DNA fragments were 

30 transformed into diploid cells of S. cerevisiae AOD1 (mating type a/a, genotype ura3-52/ura3-52, leu2-3 
112/leu2-3 112, trp1/TRP1, thr4/THR4, his4/HIS4) by the above-mentioned method of R.H. Schiestl and 
transformants were screened on an SD agar plate containing leucine. The transformants thus obtained were 
allowed to undergo sporulation on a sporulation medium SP (1% of potassium acetate, 2% of agar) and 
subjected to the tetrad analysis. As a result, it was found out that two of the four ascospores underwent 

35 germination and formed colonies but the residual two spores did not undergo colony formation. That is to 
say, the spores suffering from the replacement of the scaur1 s gene by the disrupted gene were not 
propagated. It has been thus revealed that the scaur1 s gene is essentially required for the growth of the 
cells. 

40 Example 4: Examination on the expression of aureobasidin sensitive gene spaurl by northern hybridization 

From a normal strain or a resistant strain of Schizo. pombe , the whole RNAs were extracted and 
purified by the method of R. Jensen et al. [Proc. Natl. Acad. Sci. USA, 80, 3035 - 3039 (1983)]. Further, 
poly(A) + RNA was purified by using Oligotex™-dT30 (manufactured by Takara Shuzo Co., Ltd.). The purified 

45 poly(A) + RNA (2.5 ng) was separated by the electrophoresis on a 1.2% agarose gel containing formaldehyde 
and transferred onto a nylon membrane (Hybond™-N). After immobilizing, the hybridization was performed 
with the use of a Hindlll-Sacl fragment (2 kb) of the spaurl R gene labeled with [c*- 32 P]dCTP as a probe. As 
a result, both of the normal cells and the resistant cells showed a band of the same amount of about 2 kb. 
In both cases, this amount underwent no change in the logarithmic growth phase and the stationary phase 

so (Fig. 10). Fig. 10 is an autoradiogram showing the results of the northern hybridization wherein mRNAs 
obtained from cells of a sensitive strain of Schizo. pombe in the logarithmic growth phase (lane 1 ), cells of a 
resistant strain in the logarithmic growth phase (lane 2), cells of the sensitive strain in the stationary phase 
(lane 3) and cells of the resistant strain in the stationary phase (lane 4) are electrophoresed on a 1.2% 
agarose gel containing formaldehyde. 

55 
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Example 5: Determination of the activity of scaur1 s gene 

5-a) Construction of plasmid YEpSCARW3 (Fig. 9) and YEpSCARWI 

5 The plasmid pSCARI prepared in Example 2-e) was cleaved with Hindlll and a fragment of 2 kb 
containing the whole ORF was excised. This fragment was inserted into the Hindlll site of a expression- 
plasmid YEp52 having a promoter Gal10, the expression of which was induced by galactose in a medium. 
The plasmid having the scaur1 s gene which had been inserted in such a direction as to be normally 
transcribed by the promoter GahO was named YEpSCARW3. Fig. 9 shows the structure of this plasmid. 

w Further, the plasmid having the scaur1 s gene inserted in the opposite direction was named YEpSCARWI. 

5- b) Transformation by plasmids YEpSCARW3 and YEpSCARWI 

By using 5 ug portions of the plasmids YEpSCARW3 and YEpSCARWI, the diploid S. cerevisiae cells 
75 with the disrupted scaur1 s gene prepared in Example 3-b) were transformed. Then transformants were 
screened on an SD agar plate. These transformants were allowed to undergo sporulation on an SP medium 
and then subjected to the tetrad analysis. When the expression of the scaur1 s gene was induced by using a 
YPGal medium (1% of yeast extract, 2% of polypeptone, 2% of galactose), the ascospores formed from the 
diploid cells transformed by YEpSCARW3 all underwent germination while two of the four ascospores 
20 formed from the diploid ceils transformed by YEpSCARWI underwent germination but not the remaining 
two. It is thus conceivable that the cells with the disrupted scaur1 s gene have reverted to the normal state 
by introducing YEpSCARW3 containing the scaurl s gene into these cells. Accordingly, the use of these 
cells with the disrupted scaur1 s gene as a host makes it possible to determine the activity of normal aurl- 
analogous genes carried by other organisms. 

25 

Example 6: Confirmation and cloning of aurl and aur2 genes (caaurl, caaur2) carried by C. albicans 

6- a) Detection of aurl gene by the PCR method 

30 Poly(A) + RNA was extracted and purified from an aureobasidin sensitive strain C. albicans TIMM0136 by 
the same method as the one employed in Example 4. By using the poly(A) + RNA (5 ug) as a template, a 
double-stranded cDNA was synthesized on a cDNA synthesizing system Plus (manufactured by Amersham) 
with the use of an oligo(dT) primer. Mixed primers for PCR corresponding to amino acid sequence regions 
being common to the amino acid sequences of S. cerevisiae and Schizo. pombe were synthesized on a 

35 DNA synthesizer and purified. That is to say, a primer of SEQ ID No. 11 in Sequence Listing corresponding 
to the region of amino acids at the 184- to 192-positions of SEQ ID No. 4 in sequence Listing of Schizo. 
pombe (from the 184- to 192-positions of SEQ ID No. 8 in Sequence Listing of S. cerevisiae ) and another 
primer of SEQ ID No. 12 in Sequence Listing corresponding to the region of amino acids from the 289- to 
298-positions of Schizo. pombe (from the 289- to 298-positions of SEQ ID No. 8 in Sequence Listing of S. 

40 cerevisiae ) were employed. 

PCR was performed by using these primers and the above-mentioned cDNA as a template by repeating 
a cycle comprising treatment at 94 •C for 30 seconds, one at 48* C for I minute and one at 72 *C for 2 
minutes 25 times. As a result, a DNA (about 350 bp) being almost the same as S. cerevisiae and Schizo. 
pombe in length was amplified (Fig. 6). Fig. 6 shows a pattern obtained by carrying out PCR with the use of 

45 cDNA of C. albicans (lane 1), cDNA of S. cerevisiae (lane 2) and cDNA of Schizo. pombe (lane 3) as a 
template, electrophoresing each PCR product on an agarose gel and staining with ethidium bromide. 

6-b) Cloning of aurl gene (caaurl) of C. albicans 

so (i) Genomic DNA was extracted and purified from a strain C. albicans TIMM0136 by the same method as 
the one described in Example 1-c). After partially digesting with Hindlll, the DNA fragment was ligated 
with a Traplexll9 vector which had been completely digested with Hindlll and transformed into E. coli 
HB101. Thus a genomic library of C. albicans was prepared. From this library, a DNA fragment of 4.5 kb 
containing the aurl gene of C. albicans was cloned by using the DNA fragment of C. albicans obtained 

55 by the PCR described in Example 6-a), which had been labeled with [a-^PJdCTP by using a random 
primer DNA labeling kit (manufactured by Takara Shuzo Co., Ltd.), as a probe. This DNA fragment had a 
restriction enzyme map shown in Fig. 7 and the DNA nucleotide sequence thereof is represented by 
SEQ ID No. 13 in Sequence Listing. Based on this nucleotide sequence, it was estimated that the caaurl 
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gene coded for a protein having the amino acid sequence represented by SEQ ID No. 14 in Sequence 
Listing. When compared with the scaur1 s protein, a homology of as high as 53% was observed. A 
Traplexll9 vector having this caaurl gene integrated therein was named pCAARI, while E. coli HB101 
transformed by this plasmid was named and designated as Escherichia coli HB101/pCAAR1. This strain 
has been deposited at National Institute of Bioscience and Human-Technology, Agency of Industrial 
Science and Technology in accordance with the Budapest Treaty under the accession number FERM 
BP-4482. 

Next, pCAARI was treated with Hindlll to thereby give caaurl of 4.5 kb. Further, it was integrated 
into pTV118 which had been completely digested with Hindlll to thereby prepare a plasmid for 
expressing caaurl. This plasmid was named pTCAARI. 

(ii) Genomic DNA was extracted and parified from a strain C. albicans TIMM1768 [The journal of 
Antibiotics, 46, 1414-1420(1993)] by the same method as the one described in example 1-c). After 
partially digesting with Hin d III, the DNA fragment was ligated with a pUC118 vector which had been 
completely digested with Hin d III and transformed into E. coli HB101. Thus a genomic library of C. 
albicans TIMM1768 was prepared. From this library, a DNA fragment of 4.5 kb containing the aurl gene 
of C. albicans TIMM1768 was cloned by the colony hybridization with the same probe as that described 
in Example 6-b)-(i). This DNA fragment had the same restriction enzyme map as that shown in Fig. 7. 
Next, a part of the DNA sequence containing a ORF in this DNA fragment was determined. The DNA 
nucleotide sequence thereof is represented by SEQ ID No. 21 in Sequence Listing. Based on this 
nucleotide sequence, it was estimated that this gene coded for a protein having the amino acid sequence 
represented by SEQ ID No. 22 in Sequence Listing. When the amino acid sequence of the caaurl 
protein C. albicans TIMM1768 was compared with that of the caaurl protein of C. albicans TIMM0136, 
the amino acid sequences of the 1- to 381 -positions and the 383- to 423-positions and the 425- to 471- 
positions of caaurl protein (SEQ ID No. 14 in Sequence Listing) in C. albicans TIMM0136 were identical 
with the amino acid sequences of the 2- to 382-positions and the 384- to 424-positions and the 426- to 
472-positions, respectively, of caaurl protein (SEQ ID No. 22 in Sequence Listing) in C. albicans 
TIMM1768. 

However, serines at the 382- and 424-positions of SEQ ID No. 14 in Sequence Listing were replaced 
with prolines at the 383- and 425-positions of SEQ ID No. 22 in Sequence Listing. 

6-c) Cloning of aur2 gene (caaur2) of C. albicans 

Genomic DNA of a strain C. albicans TIMM0136 was digested with Bam HI and ligated with a pTV118 
vector which had been completely digested with Bam HI. Then it was transformed into E. coli HB101 to 
thereby prepare a genomic library of C. albicans . On the other hand, the DNA fragment containing the 
scaur2 s gene obtained in Example 2-h) was cleaved with Hindlll and Pstl to thereby give a DNA fragment of 
1.2 kb. This DNA fragment was labeled with [a- 32 P]dCTP by using a random primer DNA labeling kit. By 
using this labeled DNA fragment as a probe, the above-mentioned C. albicans genomic library was 
screened by the colony hybridization. Thus a plasmid containing a DNA fragment of 8.3 kb was obtained. A 
part of the DNA sequence upstream of the Bam HI site of this DNA fragment was determined (SEQ ID No, 
15 in Sequence Listing). Based on this sequence, an amino acid sequence represented by SEQ ID No. 16 
in Sequence Listing was estimated. It corresponded to the amino acid sequence of the 1230- to 1309- 
positions of the amino acid sequence of the scaur2 gene (SEQ ID No. 10), having a homology of as high as 
77%. Since this DNA fragment lacked a part of the C-end, the genomic library prepared in Example 6-b) 
was further screened by using this DNA fragment as a probe. Thus a DNA fragment of 6.5 kb having the C- 
terminal part was obtained. Fig. 8 shows the restriction enzyme map of the DNA region containing the 
caaur2 gene thus clarified. 

A pTVl18 vector having the above-mentioned caaur2 gene of 8.3 kb integrated therein was named 
pCAAR2N, while E. coli HB101 transformed by this plasmid was named and designated as Escherichia coli 
HBl01/pCAAR2N. This strain has been deposited at National Institute of Bioscience and Human-Technol- 
ogy, Agency of Industrial Science and Technology in accordance with the Budapest Treaty under the 
accession number FERM BP-4481 . 
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Example 7: Preparation of antibody against protein coded for by scaur1 s gene and staining of S. cerevisiae 
cells and detection of said protein by using this antibody. 

7-a) Preparation of antibody 

SCAR 1-1 (SEQ ID No. 19 in Sequence Listing) comprising a peptide corresponding to the amino acids 
at the residue 103 to 113 in the amino acid sequence of SEQ ID No. 8 in Sequence Listing having cysteine 
added to the N-end thereof and SCAR 1-2 (SEQ ID No. 20 in Sequence Listing) comprising a peptide 
corresponding to the amino acids at the residue 331 to 348 in the amino acid sequence of SEQ ID No. 8 
having cysteine added to the N-end thereof were synthesized by the Fmoc solid phase synthesis method 
and purified by reverse phase HPLC. Thus 10 mg portions of these peptides were obtained. To the N- 
terminal cysteine of each of these synthetic peptides, KLH was bound as a carrier protein. By using this 
binding product as an antigen, a rabbit was immunized and an antiserum was obtained. This antiserum was 
further purified on an affinity column prepared by binding the synthetic peptide employed as the antigen to 
an agarose gel. This a polyclonal antibody being specific for the synthetic peptide was prepared. 

7-b) Staining of S. cerevisiae cells with antibody 

A strain S. cervisiae ATCC 9763 was cultured in a YNBG medium [0.67% of yeast nitrogen base 
(manufactured by Difco), 2% of glucose] to thereby give a suspension of a concentration of 3 x 10 7 
cells/ml. To 1 ml of this ceil suspension were added 0.11 mi of a 1 M phosphate buffer (pH 6.5) and 0.17 
ml of 37% formaldehyde. After slowly stirring at room temperature for 1 hour, the cells were harvested by 
centrifugation and then suspended in 20 ml of an SS buffer (1 M of sorbitol, 0.2 % of 0-mercaptoethanol, 
0.1 M phosphate buffer, pH 7.5) containing 20 ug/ml of Zymolyase 20T. After treating at 30 *C for 1 hour, 
the cells were harvested, washed with the SS buffer, suspended in 1 ml of the SS buffer containing 0.1% of 
Triton X-100 and then allowed to stand for 10 minutes. This cell suspension was placed on a slide glass 
which had been coated with poly(L-lysine) and allowed to stand for 10 minutes. Next, a PBS solution 
containing 1% of albumin (BSA) was dropped thereinto. After allowing to stand at room temperature for 15 
minutes, the excessive liquid was removed and then a PBS solution containing BSA containing 0.02 mg/mi 
of the antiSCAR1-1 antibody was dropped thereinto. After allowing to stand at room temperature for 60 
minutes and washing with PBS containing BSA three times, antirabbit IgG antibody labeled with FITC 
(antibody concentration 0.02 mg/ml) was layered over and allowed to stand at room temperature for 1 hour. 
After washing with a PBS solution containing BSA, a small amount of a mountain solution, which was a 
solution prepared by dissolving 0.1 g of p-phenylenediamine in 10 ml of CBS (150 mM of NaCI, 50 mM of 
CHES, pH 9.5), adjusting the pH value to 9.0 with 10 N NaOH and further adding 90 ml of glycerol, was 
layered over. Then a cover glass was placed thereon to thereby give a specimen. This specimen was 
observed under a fluorescence microscope to thereby examine the intracellular distribution of the scaurl 
protein. As a result.it was found out that this protein was distributed all over the ceils. 

7-c) Detection of protein coded for by scaurl gene by using antibody 

The plasmid YEpSCARW3 prepared in Example 5-a) was introduced into a normal haploid S. cerevisiae 
SH3328 to thereby give a transformant. This transformant was cultured in a YPGal medium or a YPD 
medium and the cells were harvested by centrifugation. The cells thus obtained were suspended in a buffer 
(1% of Triton X-100, 1% of SDS, 20 mM of Tris-HCI, pH 7.9, 10 mM of EDTA, 1 mM of DTT, 1 mM of 
PMSF). Further, glass beads were added thereto to disrupt the cells by vigorous vortex. Then an SDS 
loading solution was added thereto and the protein was denatured by treating at 95 * C for 5 minutes. After 
centrifuging, a part of the obtained supernatant was subjected to SDS-PAGE and the protein thus separated 
was transfered onto an Immobilon membrane (manufactured by MILLIPORE). This Immobilon membrane 
was treated with Block Ace (manufactured by Dainippon Pharmaceutical Co., Ltd.). Then the antiSCAR1-2 
antibody prepared in 7-a) was reacted therewith as a primary antibody. After washing, antirabbit IgG 
antibody labeled with peroxidase was reacted therewith as a secondary antibody and the mixture was 
thoroughly washed. Next, it was color-developed with diaminobenzidine and a band of the scaurl protein 
was detected. Fig. 11 shows the results. 

Fig. 11 shows the results of the detection of the protein prepared from the cells incubated in the YPD 
medium (lane 1) and the protein prepared from the cells incubated in the YPGal medium (lane 2), each 
subjected to SDS-PAGE, by using the antiSCAR1-2 antibody. The cells incubated in the YPGal medium, of 
which scaurl gene had been induced, showed a specific band. 
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[Effects of the Invention] 

According to the present invention, a novel protein regulating aureobasidin sensitivity and a gene 
coding for the protein, i.e., a gene regulating aureobasidin sensitivity are provided. These substances are 

5 useful in the diagnosis and treatment for diseases caused by organisms having the above-mentioned gene, 
such as mycoses. The present invention further provides an antisense DNA and an antisense RNA of this 
gene, a nucleic acid probe being hybridizable with the gene, a process for detecting the gene by using this 
nucleic acid probe, a process for producing a protein regulating aureobasidin sensitivity by using a 
transformant having the gene introduced thereinto, an antibody for the protein and a process for detecting 

io the protein by using this antibody. They are also useful in the diagnosis and treatment of diseases including 
mycoses. 
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Sequence Listing 
SEQ ID NO : 1 
6 SEQUENCE LENGTH : 2385 

SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : doubie • 

70 

TOPOLOGY : linear 

MOLECULE TYPE : genomic DNA 

SEQUENCE DESCRIPTION : 

75 





AAGCTTTTTT 


GCCTCTGCAA 


AAGTTCCTTT 


CTCGAATTGG 


TTTTTTGAGG 


AAAAGCAAGT 


60 




TAATAAACTA 


ATTATATTAT 


ATATAATTAG 


CAATTTTATA 


AAAAAAATAA 


AAAAATAGCC 


120 


20 


CTGATTGCTG 


GCAACTGTGA 


GCTGAACATT 


GGTTAATCGG 


TCCATCTTTT 


TTTAAATATT 


180 




TTACATCGCT 


ACTTTTAAGT 


GCTTGACACT 


TGCATTTAAT 


AGCTACTTTC 


TTTCCTTCAT 


240 




AAAAATTCCT 


TTTTTTTCCT TTAGTTTTCC 


GGTTAATTCC 


TTACGAAATT 


TTTTTCGTAC 


300 


25 


GCTTCCCTTT 


TTTACTCTGA TAATTCTTTG 


AAGCAATGTC 


TGCTCTTTCG 


ACCTTAAAAA 


360 






TGCGTGTAAC 


CGAGCATCCC 


AATACAAGTT 


CCA A AC A A CP 
UuAAAV/AAul 


TT A A AfTTT A 
1 \ AAAUtt 1 A 


a 9 n 




TGCCTACATT 


TCGTTTGCTA 


CGCAATACGA 


AATGGTCATG 


GACACATTTG 


CAATATGTGT 


480 


30 


TTCTAGCAGG 


TAATTTGATT 


TTTGCTTGTA 


TTGTCATTGA 


ATCTCCTGGA 


TTCTGGGGGA 


540 




AATTTGGCAT 


TGCCTGTCTT 


TTGGCCATTG 


CGTTGACCGT 


TCCTTTAACA 


CGCCAAATTT 


600 




THTTCCTGC 


CATTGTTATC 


ATCACCTGGG 


CAATTTTATT 


TTACTCTTGT 


ACGTTTATTC 


660 


35 


CAGAACGCTG 


GCGTCCACCC 


ATATGGGTTC GTGTTTTACC 


CACACTTGAA 


AATATTCTTT 


720 




ATGGCTCTAA 


TCTTTCTAGT 


CTTCTCTCGA 


AAACCACGCA 


TAGCATCCTT 


GATATTTTGG 


780 




CCTGGGTTCC 


ATATGGAGTC ATGCATTATT 


CGGCTCCTTT 


TATCATTTCA 


TTTATTCTTT 


840 


40 


TCATCTTTGC 


ACCTCCTGGA 


ACTCTTCCAG 


TTTGGGCTCG 


AACTTTTGGT 


TATATGAATT 


900 




TATTTGGTGT 


TCTTATCCAA 


ATGGCTTTCC 


CCTGTTCTCC 


TCCTTGGTAT 


GAAAATATGT 


960 


45 


ATGGTTTAGA 


ACCTGCCACG 


TATGCAGTAC GTGGCTCTCC 


TGGTGGATTG 


GCCCGTATTG 


1020 


ATGCTCTCTT 


CGGCACTAGC ATTTACACTG 


ATTGTTTTTC 


TAACTCTCCG 


GTTGTTTTTG 


1080 




GTGCCTTTCC 


ATCTCTTCAC GCTGGATGGG 


CCATGCTGGA 


AGCACTTTTC 


CTTTCGCATG 


1140 


50 


TGTTTCCTCG 


ATACCGCTTC 


TGCTTTTATG 


GATATGTTCT 


ATGGCTTTGC 


TGGTGTACTA 


1200 


TGTACCTTAC 


CCACCACTAC 


TTTGTAGATT 


TGGTCGGCGG 


TATGTGTTTA 


GCTATTATAT 


1260 



55 
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GCTTCGTTTT 


TGCTCAAAAG 


CTACGCCTCC CACAGTTGCA 


AACTGGTAAA ATCCTTCGTT 


1320 


GGGAATACGA 


GTTTGTTATC 


CACGGTCATG GTCTTTCCGA 


AAAAACCAGC AACTCCTTGG 


1380 


CTCGTACCGG 


CAGCCCATAC 


TTACTTGGAA GGGATTCTTT 


TACTCAAAAC CCTAATGCAG 


1440 


TAGCCTTCAT 


GAGTGGTCTT 


AACAATATGG AACTTGCTAA 


CACCGATCAT GAATGGTCCG 


1500 


TGGGTTCATC 


ATCACCTGAG 


CCGTTACCTA GTCCTGCTGC 


TGATTTGATT GATCGTCCTG 


1560 


CCAGTACCAC 


TTCCTCCATC 


TTTGATGCAA GTCATCTTCC 


TTAAATCAAC GTGCTTTAAG 


1620 


AATATATTTC 


CAAAAGCTAC 


ATGATACATT GACTAGAATC 


GGTTTGATTC ATAGTGGTAT 


1680 


TGGAATGATG 


TTGTTCATTG 


TGTTTTTTAA CTGTTAATCT 


GACATCCATT GAGTCATTCT 


1740 


TTACAATTTG 


TAAAATTAAT 


TTGTATCACT AATTTTGAAG 


GAAGCTATTT TGGTATTAAT 


1800 


ACCGCTTTTG 


GTCTCCACTT 


CCTTTTCGAA ACTCTTAACA 


GCGATTAGGC CGGGTATCTT 


1860 


CCAGTGTGAT 


GTATAGGTAT 


TTGTCGTTTT TTTATCATTT 


CCGTTAATAA AGAACTCTTT 


1920 


TATCCAGCTT 


CTTACACTGT 


CAACTGTTGT GAAAGGAACA 


CATTTAGAAT TTCATTTTCC 


1980 


TTATTTGTTG 


TCATTTAAAT 


CGTTTGACAT AATTTTAAAT 


TTGGTTTGAA ATGTGTGTGA 


2040 


GAAGGCTTGT 


TTTATTCATT 


TAGTTTATTG CTTGTTTGCA 


CGAAAATCCA GAACGGAGCA 


2100 


TTAATGTAAT 


CCTTTTTTAT 


TCTGTAAAGC GTTTTTATAC 


AAATGTTGGT TATACGTTTC 


2160 


TAAAATAAGA 


ATATTGTTAT 


AATAATATAG TTTTTTCTAT 


CATTTGTTAC ACACACTAAA 


2220 


GAGACATTAA 


CGATAAGCAA 


ATGTGTTAAA ATGATAATAT 


ATTTTGGAAA CATTTATAAAV 


2280 


GAAATTAAGC 


AGCTTTGACT 


AACTACATTT TTGTTTTTTT 


CCTAAGCAAA ACTGTATAGT 


2340 


TATACACGCG 


AGCTGTATTC 


ACTTCCATTG TAGTGACTTG 


AGCTC 


2385 


SEQ ID NO : 


2 









SEQUENCE LENGTH : 422 
SEQUENCE TYPE : amino acid 
STRANDEDNESS : single 
TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 
15 10 15 
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Arg Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 

20 25 30 

Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Ser Trp Thr" His Leu 

35 40 45 

Gin Tyr Val Phe Leu Ala Gly Asn Leu He Phe Ala Cys lie Val 

50 55 60 

He Glu Ser Pro Gly Phe Trp Gly Lys Phe Gly lie Ala Cys Leu 

65 70 75 

Leu Ala He Ala Leu Thr Val Pro Leu Thr Arg Gin He Phe Phe 

80 85 90 

Pro Ala He Val He He Thr Trp Ala He Leu Phe Tyr Ser Cys 

95 100 105 

Arg Phe He Pro Glu Arg Trp Arg Pro Pro He Trp Val Arg Val 

110 115 120 

Leu Pro Thr Leu Glu Asn He Leu Tyr Gly Ser Asn Leu Ser Ser 

125 130 135 

Leu Leu Ser Lys Thr Thr His. Ser He Leu Asp He Leu Ala Trp 

140 145 150 

Vai Pro Tyr Gly Val ' Met His Tyr Ser Ala Pro Phe He He Ser 

155 160 165 

Phe He Leu Phe He Phe Ala Pro Pro Gly Thr Leu Pro Val Trp 

170 175 180 

Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gly Val Leu He Gin 

185 190 195 

Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Glu Asn Met Tyr Gly 

200 205 210 

Leu Glu Pro Ala Thr Tyr Ala Val Arg Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg He Asp Ala Leu Phe Gly Thr Ser lie Tyr Thr Asp Cys 
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230 235 240 

Phe Ser Asn Ser Pro Val Val Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ala Gly Trp Ala Met Leu Glu Ala Leu Phe Leu Ser His Va! Phe 

260 265 270 

Pro Arg Tyr Arg Phe Cys Phe Tyr Gly Tyr Val Leu Trp Leu Cys 

275 . 280 285 

Trp Cys Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu Val 

290 295 300 

Gly Gly Met Cys Leu Ala lie He Cys Phe Val Phe Ala Gin Lys 

305 310 315 

Leu Arg Leu Pro Gin Leu Gin Thr Gly Lys He Leu Arg Trp Glu 

320 325 330 

Tyr Giu Phe Vat He His Gly His Gly Leu Ser Glu Lys Thr Ser 

335 340 345 

Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Leu Leu Gly Arg Asp 

350 355 . 360 

Ser Phe Thr Gin Asn Pro Asn Ala Val Ala Phe Met Ser Gly Leu 

365 ' 370 375 

Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp Ser Val Gly 

380 385 390 

Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu He 

395 400 405 

Asp Arg Pro Ala Ser Thr Thr Ser Ser He Phe Asp Ala Ser His 

410 415 420 

Leu Pro 

SEQ ID NO : 3 
SEQUENCE LENGTH : 2385 
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SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : double 
TOPOLOGY : 1 i near 
MOLECULE TYPE : genomic DNA 
SEQUENCE DESCRIPTION : 
AAGCTTTTTT GCCTCTCCAA AAGTTCCTTT 
TAATAAACTA ATTATATTAT ATATAATTAG 
CTGATTGCTG GCAACTGTGA GCTGAACATT 
TTACATCGCT ACTTTTAAGT GCTTGACACT 
AAAAATTCCT TTTTTTTCCT TTAGTTTTCC 
GCTTCCCTTT TTTACTCTGA TAATTCTTTG 
AGCGCCTTGC TGCGTGTAAC CGAGCATCCC 
TGCCTACATT TCGTTTGCTA CGCAATACGA 
TTCTAGCAGG TAATTTGATT TTTGCTTGTA 
AATTTGGCAT TGCCTGTCTT TTGGCCATTG 
TTTTTCCTGC CATTGTTATC ATCACCTGGG 
CAGAACGCTG GCGTCCACCC ATATGGGTTC 
ATGGCTCTAA TCTTTCTAGT CTTCTCTCGA 
CCTGGGTTCC ATATGGAGTC ATGCATTATT 
TCATCTTTGC ACCTCCTGGA ACTCTTCCAG 
TATTTGGTGT TCTTATCCAA ATGGCTTTCC 
ATGGTTTAGA ACCTGCCACG TATGCAGTAC 
ATGCTCTCTT CGGCACTAGC ATTTACACTG 
GTGCCTTTCC ATCTCTTCAC GCTGGATGGG 
TGTTTCCTCG ATACCGCTTC TGCTTTTATG 
TGTACCTTAC CCACCACTAC TTTGTAGATT 
GCTTCGTTTT TGCTCAAAAG CTACGCCTCC 
GGGAATACGA GTTTGTTATC CACGGTCATG 
CTCGTACCGG CAGCCCATAC TTACTTGGAA 



CTCGAATTGG 


TTTTTTGAGG AAAAGCAAGT 


60 


CAATTTTATA 


AAAAAAATAA AAAAATAGCC 


120 


GGTTAATCGG TCCATCTTTT TTTAAATATT 


180 


TGCATTTAAT 


AGCTACTTTC TTTCCTTCAT 


240 


GGTTAATTCC 


TTACGAAATT TTTTTCGTAC 


300 


AAGCAATGTC 


TGCTCTTTCG ACCTTAAAAA 


360 


AATACAAGTT "GGAAACAAGC TTAAACCCTA 


420 


AATGGTCATG 


GACACATTTG CAATATGTGT 


480 


TTGTCATTGA 


ATCTCCTGGA TTCTGGGGGA 


540 


CGTTGACCGT 


TCCTTTAACA CGCCAAATTT 


600 


CAATTTTATT 


TTACTCTTGT AGGTTTATTC 


660 


GTGTTTTACC 


CACACTTGAA AATATTCTTT 


720 


AAACCACGCA 


TAGCATCCTT GATATTTTGG . 


780 


CGGCTCCTTT 


TATCATTTCA TTTATTCTTT 


840 


TTTGGGCTCG 


AACTTTTGGT TATATGAATT 


900 


CCTGTTCTCC 


TCCTTGGTAT GAAAATATGT 


960 


GTGGCTCTCC 


TGGTGGATTG GCCCGTATTG 


1020 


ATGGTTTTTC 


TAACTCTCCG GTTGTTTTTG 


1080 


CCATGCTGGA 


AGCACTTTTC CTTTCGCATG 


1140 


GATATGTTCT 


ATGGCTTTGC TGGTGTACTA 


1200 


TGGTCGGCGG 


TATGTGTTTA GCTATTATAT 


1260 


CACAGTTGCA 


AACTGGTAAA ATCCTTCGTT 


1320 


GTCTTTCCGA 


AAAAACCAGC AACTCCTTGG 


1380 


GGGATTCTTT 


TACTCAAAAC CCTAATGCAG 


1440 
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TAGCCTTCAT GAGTGGTCTT AACAATATGG AACTTGCTAA CACCGATCAT GAATGGTCCG 1500 
TGGGTTCATC ATCACCTGAG CCGTTACCTA GTCCTGCTGC TGATTTGATT GATCGTCCTG 1560 
CCAGTACCAC TTCCTCCATC TTTGATGCAA GTCATCTTCC TTAAATCAAC GTGCTTTAAG 1620 
AATATATTTC CAAAAGCTAC ATGATACATT GACTAGAATC GGTTTGATTC ATAGTGGTAT 1680 
TGGAATGATG TTGTTCATTG TGTTTTTTAA CTGTTAATCT GACATCCATT GAGTCATTCT 1740 
TTACAATTTG TAAAATTAAT TTGTATCACT AATTTTGAAG GAAGCTATTT TGGTATTAAT 1800 
ACCGCTTTTG GTCTCCACTT CCTTTTCGAA ACTCTTAACA GCGATTAGGC CGGGTATCTT 1860 
CCAGTGTGAT GTATAGGTAT TTGTCGTTTT TTTATCATTT CCGTTAATAA AGAACTCTTT 1920 
TATCCAGCTT GTTACACTGT CAACTGTTGT GAAAGGAACA CATTTAGAAT TTCATTTTCC 1980 
TTATTTGTTG TGATTTAAAT CGTTTGACAT AATTTTAAAT TTGGTTTGAA ATGTGTGTGA 2040 
GAAGGCTTGT TTT ATTCATT TAGTTTATTG CTTGTTTGCA CGAAAATCCA GAACGGAGCA 2100 
TTAATGTAAT CCTTTTTTAT TCTGTAAAGC GTTTTTATAC AAATGTTGGT TATACGTTTC 2160 
TAAAATAAGA ATATTGTTAT AATAATATAG TTTTTTCTAT CATTTGTTAC ACACACTAAA 2220 
GAGACATTAA GGATAAGCAA ATGTGTTAAA ATGATAATAT ATTTTGGAAA CATTTATAAA 2280 
GAAATTAAGC AGCTTTGACT AACTACATTT TTGTTTTTTT CCTAAGCAAA ACTGTATAGT 2340 
TATACACGCG AGCTGTATTC ACTTCCATTG TAGTGACTTG AGCTC . 2385 

SEQ ID NO : 4 
SEQUENCE LENGTH : 422 
SEQUENCE TYPE : amino acid 
STRANDEDNESS : single 
TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 
1 5 10 15 

Ars Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 
20 25 30 

Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Ser Trp Thr His Leu 
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35 40 45 

Gin Tyr Val Phe Leu Ala Gly Asn Leu He Phe Ala Cys He Val 

50 55 60 

He Glu Ser Pro Gly Phe Trp Gly Lys Phe Gly He Ala Cys Leu 

65 70 75 

Leu Ala He Ala Leu Thr Val Pro Leu Thr Arg Gin He Phe Phe 

80 85 90 

Pro Ala He Val He He Thr Trp Ala He Leu Phe Tyr Ser Cys 

95 100 105 

Arg Phe He Pro Glu Arg Trp Arg Pro Pro He Trp Val Arg Val 

110 115 120 

Leu Pro Thr Leu Glu Asn He Leu Tyr Gly Ser Asn Leu Ser Ser 

125 130 135 

Leu Leu Ser Lys Thr Thr His Ser He Leu Asp He Leu Ala Trp 

140 145 150 

Val Pro Tyr Gly Val Met His Tyr Ser Ala Pro Phe He He Ser 

155 160- 165 

Phe lie Leu Phe He Phe Ala Pro Pro Gly Thr Leu Pro Val Trp 

170- 175 180 

Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gly Val Leu He Gin 

185 190 195 

Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Glu Asn Met Tyr Gly 

200 205 210 

Leu Glu Pro Ala Thr Tyr Ala Val Arg Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg He Asp Ala Leu Phe Gly Thr Ser He Tyr Thr Asp Gly 

230 235 240 

Phe Ser Asn Ser Pro Val Val Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 
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Ala Gly Trp Ala Met Leu Glu Ala Leu Phe "Leu Ser His Val Phe 

260 265 270 

Pro Arg Tyr Arg Phe Cys Phe Tyr Gly Tyr Val Leu Trp Leu Cys 

275 280 285 

Trp Cys Thr Met Tyr Leu Thr His His Tyr Phe Vai Asp Leu Val 

290 295 300 

Gly Gly Met Cys Leu Ala He He Cys Phe Val Phe Ala Gin Lys 

305 310 315 

Leu Arg Leu Pro Gin Leu Gin Thr Gly Lys He Leu Arg Trp Glu 

320 325 330 

Tyr Glu Phe Val lie His Gly His Gly Leu Ser Glu Lys Thr Ser 

335 340 345 

Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Leu Leu Gly Arg Asp 

350 355 360 

Ser Phe Thr Gin Asn Pro Asn Ala Val Ala Phe Met Ser Gly Leu 

365 370 375 

Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp Ser Val Gly 

380 385 390 

Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu He 

395 400 405 

Asp Arg Pro Ala Ser Thr Thr Ser Ser He Phe Asp Ala Ser His 

410 415 420 

Leu Pro 



SEQ ID NO : 5 
SEQUENCE LENGTH : 2340 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : double 
TOPOLOGY : linear 



27 



EP 0 644 262 A2 



MOLECULE TYPE : genomic DNA 
SEQUENCE DESCRIPTION : 
TTTCTTTCTG ICAAAGAATA ATAAAGTGCC 
TTTCTGATTT GGTACTACTG CAGAGGCGTA 
CACTAGCGAC TTTTGTTCGT GAACCAACAG 
TAGTTGGTTA GTCCGATCGC TCACTTTTGG 
TTTCCTTTTT CACACTGAGC TACTTTGGGT 
TCTCCAAAAG TACTTTATTT TATATCCTAC 
TTTAATCATT CCTTTGCGTA TGGCAAACCC 
TCCAAACTGC CATGTAGCCG ATTTAGAAAC 
GGTGCAAAAA TACAAACCCG CTTTAAGCGA 
CATGCTGTTT GTGTTCATTA CTAATCCCGC 
TTTCTTGGGC ACTTTATTCA TCATTCCAGC 
CATCCTAACA TGGGTGGCCC TGTATTTCAC 
TCCTATTACT GTCAAAGTGT TACCAGCGGT 
TGATATTCTT GCAACATCGA CGAATTCCTT 
ACTATTTCAT TATGGGGCCC CATTTGTCGT 
AACTGTTTTG CAAGGTTATG CTTTTGCATT 
GCAAAATGTC TTTCCAGCCG CTCCCCCATG 
CAACTATGAT ATGCATGGCT CGCCTGGTGG 
TAATATGTAT ACTACAGCTT TTTCAAATTC 
GCATTCCGGG TGTGCTACTA TGGAAGCCCT 
GCCCTTGTTT ATTGCTTATG TTTGCTGGTT 
TTATTTTGTA GACCTTATGG CAGGTTCTGT 
GTACACACAT TTACCAATTG TAGATACATC 
TGAGAAATAC GATATATCAA AGAGTGATCC 
TGTCCCTTTG TCCAACTTGG AACTTGACTT 
AAGCCCTTCG TTATTTGATG GATCTACTTC 
GTCACTAGGT GTAAAGAGGG CTTAATGAGT 



CATCAGTGTT CATATTTGTT ACAAAGTGGT 60 
TTTTTTGCTT CAGTTACCAT AGCGTAAGAA 120 
AGTAGGATTT CTACTGCTAC ATCTCTTAGG 180 
TTGTTGTTAA GTACTTCATA AGTTTATCCT 240 
ATAGCTTTTG GCCCAAGGAT CTTTGAATTT 300 
AGGTTGCGGT TTTCATATTT TAAAAAGCTT 360 
TTTTTCGAGA TGGTTTCTAT CAGAGAGACC 420 
AAGTTTAGAT CCCCATCAAA CGTTGTTGAA 480 
CTGGGTGCAT TACATCTTCT TGGGATCCAT 540 
ACCTTGGATC TTCAAGATCC TTTTTTATTG 600 
TACGTCACAG TTTTTCTTCA ATGCCTTGCC 660 
TTCATCGTAC TTTCCAGATG ACCGCAGGCC 720 
GGAAACAATT TTATACGGCG ACAATTTAAG 780 
TTTGGACATT TTAGCATGGT TACCGTACGG 840 
TGCTGCCATC TTATTCGTAT TTGGTCCACC 900 
TGGTTATATG AACCTGTTTG GTGTTATCAT 960 
GTATAAAATT CTCTATGGAT TGCAATCAGC 1020 
ATTAGCTAGA ATTGATAAGC TACTCCGTAT 1080 
CTCCGTCATT TTCGGTGCTT TTCCTTCACT 1140 
GTTTTTCTGT TATTGTTTTC CAAAATTGAA 1200 
ATGGTGGTCA ACTATGTATC TGACACACCA 1260 
GCTGTCATAC GTTATTTTCC AGTACACAAA 1320 
TCTTTTTTGC AGATGGTCAT ACACTTCAAT 1380 
ATTGGCTGCA GATTCAAACG ATATCGAAAG 1440 
TGATCTTAAT ATGACTGATG AACCCAGTGT 1500 
TGTTTCTCGT TCGTCCGCCA CGTCTATAAC 1560 
ATTTTATCTG CAATTACGGA TACGGTTGGT 1620 
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CTTATGTAGA TACATATAAA TATATATCTT TTTCTTTCTT TTTCTTAGTC AGCATTGTCG 1680 
TTTAGCATAA TATACATGTA GTTTATTTAA TCACATACCA CTGATTATCT TTAGAATTTT 1740 
ATAAATTTTT GAAATAAATG CGTCGCTTTT AATGGTGTCT ATGTTAAGTG AGGCTTTTAG 1800 
AATGCTCTTC CTGCTTTGTT TATTATATGT CTATGAAAGA TATGTATGTA TTTACATGTG 1860 
TTTGTAGCGT CCCCAGTCAA AACCTGTGCG CTATACCTAA ATGGATTGAT AATCTTCATT 1920 
CACTAATTCT AAAATAGACT TCTTCCCCAA AGAACGGTGT AACGATGAGG CTCTATCCAG 1980 
CTGCTTATCT AAATCAACTT TAACGATGGA TGATCTTATG ACACGGGGAT CTTTCTTTAA 2040 
AGTTCTTAGA ATTTCAGACT GTACCGCAGC TGATGAATCA AACAGCATTA AAAAGTGATA 2100 
TGCTCGAAAA TGTTTTTCCT GGTCTTTCTT CATTATTTTA GGAAGATACC TTATGCCCAT 2160 
GGGTACAATG TCCCTCACCA CACCTCTGTT TTGAATAATC AGTTTCCCGA TTGTGGAAGA 2220 
CAATTCTTTT GCTTCCAACT TTGGCGCATT GGAGTTGGTT ATGCGAACAA GTCCGATCAG 2280 
CTCATAAAGC ATCTTAGTGA AAAGGGTGGT TTTGCGTTAT TCTHCCTCT GTTGAAGCTT 2340 

SEQ ID NO : 6 
SEQUENCE LENGTH : 401 
SEQUENCE TYPE : amino acid 
STRANDED : single 
TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 
1 5 10 15 

Asn Cys His Val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 
20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 
35 40 45 

Val His Tyr He Phe Leu Gly Ser He Met Leu Phe Val Phe lie 
50 55 60 

Thr Asn Pro Ala Pro Trp He Phe Lys He Leu Phe Tyr Cys Phe 
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65 70 75 

Leu Gly Thr Leu Phe lie He Pro Ala Thr Ser Gin Phe Phe Phe 

80 85 90 

Asn Ala Leu Pro Me Leu Thr Trp Val Ala Leu Tyr Phe Thr Ser 

95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro He Thr Val Lys Ya! 

110 115 120 

Leu Pro Ala Val Glu Thr He Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

He Leu Ala Thr Ser Thr Asn Ser Phe Leu Asp He Leu Ala Trp 

HO 145 , 150 

Leu Pro Tyr Gly Leu Phe His Tyr Gly Ala Pro Phe Val Val Ala 

155 160 165 

Ala He Leu Phe Val Phe Gly Pro Pro Thr Val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly Val He Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys He Leu Tyr Gly 

200 205 210 

Leu Gin. Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 
Ala Arg He Asp Lys Leu Leu Gly He Asn Met Tyr Thr Thr Ala 

230 235 240 
Phe Ser Asn Ser Ser Val He Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 
Ser Giy Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe He Ala Tyr Val Cys Trp Leu Trp 

275 280 285 
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70 



75 



20 



30 



45 



50 



Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu Met 

290 295 * 300 

Ala Gly Ser Va! Leu Ser Tyr Val He Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser He Glu Lys Tyr Asp He Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp He Glu Ser Va! Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser Val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser He Thr Ser Leu Gly Val Lys Arg Ala 

395 400 



SEQ 10 NO : 7 

SEQUENCE LENGTH : 2340 
35 SEQUENCE TYPE : nucleic acid 

STRANDEDNESS : double 

TOPOLOGY : linear 
40 MOLECULE TYPE : genomic DNA 

SEQUENCE DESCRIPTION : 

TTTCTTTCTC TCAAAGAATA ATAAAGTGCC CATCAGTGTT CATATTTGTT ACAAAGTGGT 60 
TTTCTGATTT GGTACTACTG CAGAGGCGTA TTTTTTGCTT CAGTTACCAT AGCGTAAGAA 120 
CACTAGCGAC TTTTGTTCGT GAACCAACAG AGTAGGATTT CTACTGCTAC ATCTCTTAGG 180 
TAGTTGGTTA GTCCGATCGC TCACTTTTGG TTGTTGTTAA GTACTTCATA AGTTTATCCT 240 
TTTCCTTTTT CACACTGAGC TACTTTGGGT ATAGCTTTTC GCCCAAGGAT CTTTGAATTT 300 



55 
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TCTCCAAAAG TACTTTATTT TATATCCTAC 
TTTAATCATT CCTTTGCCTA TGGCAAACCC 
TCCAAACTGC CATGf AGCCG ATTTAGAAAC 
GGTGCAAAAA TACAAACCCG CTTTAAGCGA 
CATGCTGTTT GTGTTCATTA CTAATCCCGC 
TTTCTTGGGC ACTTTATTCA TCATTCCAGC 
CATCCTAACA TGGGTGGCGC TGTATTTCAC 
TCCTATTACT GTCAAAGTGT TACCAGCGGT 
TGATATTCTT GCAACATCGA CGAATTCCTT 
ACTATTTCAT TTTGGGGCCC CATTTGTCGT 
AACTGTTTTG CAAGGTTATG CTTTTGCATT 
GCAAAATGTC TTTCCAGCCG CTCCCCCATG 
CAACTATGAT ATGCATGGCT CGCCTGGTGG 
TAATATCTAT ACTACAGCTT TTTCAAATTC 
GCATTCCGGG TGTGCTACTA TGGAAGCCCT 
GCCCTTGTTT ATTGCTTATG. TTTGCTGGTT 
TTATTTTGTA GACCTTATGG CAGGTTCTGT 
GTACACACAT TTACCAATTG TAGATACATC 
TGAGAAATAC GATATATCAA AGAGTGATCC 
TGTCCCTTTG TCCAACTTGG AACTTGACTT 
AAGCCCTTCG TTATTTGATG GATCTACTTC 
GTCACTAGGT GTAAAGAGGG CTTAATGAGT 
CTTATGTAGA TACATATAAA TATATATCTT 
TTTAGCATAA TATACATGTA GTTTATTTAA 
ATAAATTTTT GAAATAAATG GGTGGCTTTT 
AATGCTCTTC CTGCTTTGTT TATTATATGT 
TTTGTAGCGT CCCCAGTCAA AACCTGTGCG 
CACTAATTCT AAAATAGACT TCTTCCCCAA 
CTGCTTATCT AAATCAACTT TAACGATGGA 



AGGTTGCGGT TTTCATATTT TAAAAAGCTT 360 
TTTTTCGAGA TGGTTTCTAT CAGAGAGACC 420 
AAGTTTAGAT CCCCATCAAA CGTTGTTGAA 480 
CTGGGTGCAT TACATCTTCT TGGGATCCAT 540 
ACCTTGGATC TTCAAGATCC TTTTTTATTG 600 
TACGTCACAG TTTTTCTTCA ATGCCTTGCC 660 
TTCATCGTAC TTTCCACATG ACCGCAGGCC 720 
GGAAACAATT TTATACGGCG ACAATTTAAG 780 
TTTGGACATT TTAGCATGGT TACCGTACGG 840 
TGCTGCCATC TTATTCGTAT TTGGTCCACC 900 
TGGTTATATG AACCTGTTTG GTGTTATCAT 960 
GTATAAAATT CTCTATGGAT TGCAATCAGC 1020 
ATTAGCTAGA ATTGATAAGC TACTCGGTAT 1080 
CTCCGTCATT TTCGGTGCTT TTCCTTCACT 1140 
GTTTTTCTGT TATTGTTTTC CAAAATTGAA 1200 
ATGGTGGTCA ACTATGTATC TGACACACCA 1260 
GCTGTCATAC GTTATTTTCC AGTACACAAA 1320 
TCTTTTTTGC AGATGGTCAT ACACTTCAAT 1380 
ATTGGCTGCA GATTCAAACG ATATCGAAAG 1440 
TGATCTTAAT ATGACTGATG AACCCAGTGT 1500 
TGTTTCTCGT TCGTCCGCCA CGTCTATAAC 1560 
ATTTTATCTG CAATTACGGA TACGGTTGGT 1620 
TTTCTTTCTT TTTCTTAGTC AGGATTGTCG 1680 
TCACATACCA CTGATTATCT TTAGAATTTT 1740 
AATGGTGTCT ATGTTAAGTG AGGCTTTTAG 1800 
CTATGAAAGA TATGTATGTA TTTACATGTG 1860 
CTATACCTAA ATGGATTGAT AATCTTCATT 1920 
AGAACGGTGT AACGATGAGG CTCTATCCAG 1980 
TGATCTTATG ACACGGGGAT CTTTCTTTAA 2040 
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AGTTCTTAGA ATTTCAGACT GTACCGCAGC TGATGAATCA AACAGCATTA AAAAGTGATA 2100 
TGCTCGAAAA TGTTTTTCCT GGTCTTTCTT CATTATTTTA GGAAGATACC TTATGCCCAT 2160 
CGGTACAATG TCCCTCACCA CACCTCTGTT TTGAATAATC AGTTTCCCGA TTGTGGAAGA 2220 
CAATTCTTTT GCTTCCAACT TTGGCGCATT GGAGTTGGTT ATGCGAACAA GTCCGATCAG 2280 
■ CTCATAAAGC ATCTTAGTGA AAAGGGTGGT TTTGCGTTAT TCTTTCCTCT GTTGAAGCTT 2340 

SEQ ID NO : 8 
SEQUENCE LENGTH : 401 
SEQUENCE TYPE : amino acid 
STRAMDEDNESS : single 
TOPOLOGY": linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 

1 5 10 15 

Asn Cys His Val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 

20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 

35 ■ 40 45 

Va! His Tyr He Phe Leu Gly Ser lie Met Leu Phe Val Phe lie 

50 55 60 

Thr Asn Pro Ala Pro Trp He Phe Lys He Leu Phe Tyr Cys Phe 

65 70 75 

Leu Gly Thr Leu Phe He lie Pro Ala Thr Ser Gin Phe Phe Phe 

80 85 90 

Asn Ala Leu Pro He Leu Thr Trp Val Ala Leu Tyr Phe Thr Ser 

95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro He Thr Val Lys Val 

110 115 120 
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Leu Pro Ala Val Glu Thr lie Leu Tyr Cly Asp Asn Leu Ser Asp 

125 130 135 

He Leu Ala Thr Ser Thr Asn Ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 

Leu Pro Tyr Gly Leu Phe His Phe Gly Ala Pro Phe Val Val Ala 

155 160 165 

Ala lie Leu Phe Val Phe Gly Pro Pro Thr Val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly Yal He Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys lie Leu Tyr Gly 

200 205 .210 

Leu Gin Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg He Asp Lys Leu Leu Gly He Asn Het Tyr Thr Thr Ala 

230 235 240 

Phe Ser Asn Ser Ser Val He Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe He Ala Tyr Val Cys Trp Leu Trp 

275 280 285 

Trp Ser Thr Het Tyr Leu Thr His His Tyr Phe Val Asp Leu Met 

290 295 300 

Ala Gly Ser Vai Leu Ser Tyr Val He Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro He Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser He Glu Lys Tyr Asp He Ser Lys Ser Asp Pro Leu 
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335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser Val Pro Leu Ser Asn Leu 

350 355 380 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser Val Ser 

365 370 375 

Pro Ser Leu Phe Asp Giy Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser Me Thr Ser Leu Gly Val Lys Arg Ala 

395 400 

SEQ ID NO : 9 
SEQUENCE LENGTH : 5340 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : double 
TOPOLOGY : linear 
MOLECULE TYPE : genomic DNA 
SEQUENCE DESCRIPTION : 



AGCGCTTCTA 


TTTTCCTCCC 


CACCGCGACG 


CGGAAATGGC 


ACATTTTTTT 


TCTTTTGCTT 


60 


CTGTGCTTTT 


GCTGTAATTT 


TTGGCATCTG 


CTATTGTATG 


AAGATAACGC 


GTGGTTCCGT 


120 


GGAAATACCC 


GGAAATTTTG 


CCGGGAATAT 


GACGGACATG 


ATTTAACACC 


CGTGGAAATG 


180 


AAAAAAGCCA 


AGGTAAGAAA 


GTGGCAATAT 


-TTTTCCTACA 


AATAGATCTG 


CTGTCCCTTA 


240 


GATGATTACC 


ATACATATAT 


ATATTTATTA 


CACACATCTG 


TCAGAGGTAG 


CTAGCGAAGG 


300 


TGTCACTGAA 


ATATTTTTTG 


TTCCAGTTAG 


TATAAATACG 


GAGGTAGAAC 


AGCTCTCCGC 


360 


GTGTATATCT 


TTTTTTGCGC 


TATACAAGAA 


CAGGAAGAAC 


GCATTTCCAT 


ACCTTTTTCT 


420 


CCTTACAGGT 


GCCCTCTGAG 


TAGTGTCACG 


AACGAGGAAA 


AAGATTAATA 


TTACTGTTTT 


480 


TATATTCAAA 


AAGAGTAAAG 


CCGTTGCTAT 


ATACGAATAT GACGATTACC 


GTGGGGGATG 


540 


CAGTTTCGGA 


GACGGAGCTG 


GAAAACAAAA 


GTCAAAACGT 


GGTACTATCT 


CCCAAGGCAT 


600 


CTCCTTCTTC 


AGACATAAGC 


ACAGATGTTG 


ATAAAGACAC 


ATCGTCTTCT 


TGGGATGACA 


660 


AATCTTTGCT 


GCCTACAGGT 


GAATATATTG 


TGGACAGAAA 


TAAGCCCCAA 


ACCTACTTGA 


720 
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ATAGCGATGA TATCGAAAAA GTGACAGAAT 
CATTCTTGCA CTCTAAGAAA ATTCCAGAAG 
ATCCTCTGTT CCATACAAAT ATTATCTCTA 
TGCGAGTTGG TTATAAGAGA ACGATACAGC 
TGTCTATAGA GACCCTTTAT GACGACTTTG 
CGAGGAAAAA ATACCGTAAA AGACATCCAG 
CCAAACTACC TAAACATACA GTTCTGAGAG 
TCATGTCGAT AGTGTTTGCA ATTCTCGCTA 
CCAAGAGGCT AATTGAGTTT GTCGAAGAAA 
AAGGTATTGG TTACGCTATT GGTGCATGTT 
ATCATTTCTT TCATACATCC CAACTGACTG 
CTGCCATGAA GAAAATGTTT AATGCATCTA 
AAGTGACTTC TTTTGTAACA ACAGATCTCG 
CGTTTTTGGC TGGGTTCCCT GCAATTTTGG 
TTGGACCCAT TGCCTTAGTT GGGATTGGTA 
. TTGCATTTAA GTTAATTCTG GGCTTTAGAA 
TTACCATGAT GAGAGAAGTG CTGAATAATA 
ATGCGTATGA AAAAAATATT CAAGATATTA 
TGCAACTATC AAGAAATTTC TTGATTGCTA 
TGGTCACTTT CCTTGCAATG TACAAAGTTA 
TTGCCTCTTT ATCTTTATTT CAGGTCTTGA 
TTGGTACTGG AATTGACATG ATCATTGGAT 
CAGAAGATGA TCCAAATCAG ATGATTGAAA 
TGGCTCTAAA AATGACACAT TGCTCATTTG 
TTGAAGAAGC AAAAGGAGAA GCTAAAGATG 
ATACATGGGG TAAGCCATCT GCAAGTACTA 
AAGACAGAGA CGGCCCGGAA GATTTAGAAA 
ACTTCGATAT TAAAAAGGGC GAATTTATTA 
CTTCATTATT GAATGCGATG GCAGGATCAA 



CTGATATTTT 


CCCTCAGAAA 


CGTCTGTTTT 


780 


TACCACAAAC 


CGATGACGAG 


AGGAAGATAT 


840 


ACATGTTTTT 


TTGGTGGGTT 


CTACCCATCC 


900 


CGAACGATCT 


CTTCAAAATG 


GATCCGAGGA 


960 


AAAAAAACAT 


GATTTACTAT 


TTTGAGAAGA 


1020 


AAGCGACAGA 


AGAAGAGGTT 


ATGGAAAATG 


1080 


CTTTATTATT 


CACTTTTAAG 


AAACAGTACT 


1140 


ATTGTACATC 


CGGTTTTAAC 


CCCATGATTA 


1200 


AGGCTATTTT 


TCATAGCATG 


CATGTTAACA 


1260 


TGATGATGTT 


CGTTAACGGG 


TTGACGTTCA 


1320 


GTGTGCAAGC 


TAAGTCTATT 


CTTACTAAAG 


1380 


ATTATGCGAG 


ACATTGTTTT 


CCTAACGGTA 


1440 


CTAGAATTGA 


ATTTGCCTTA 


TCTTTTCAGC 


1500 


CTATTTGCAT 


TGTTTTATTG 


ATCGTTAACC 


1560 


TTTTTTTCGG 


TGGGTTTTTC 


ATATCCTTAT 


1620 


TTGCTGCGAA 


CATCTTCACT 


GATGCTAGAG 


1680 


TAAAAATGAT 


TAAATATTAT 


ACGTGGGAGG 


1740 


GGACCAAAGA 


GATTTCTAAA 


GTTAGAAAAA 


1800 


TGGCCATGTC 


TTTGCCTAGT 


ATTGCTTCAT 


1860 


ATAAAGGAGG 


CAGGCAACCT 


GGTAATATTT 


1920 


GTTTGCAAAT 


GTTTTTCTTA 


CCTATTGCTA 


1980 


TGGGCCGTTT 


GCAAAGCTTA 


TTGGAGGCTC 


2040 


TGAAGCCCTC 


TCCTGGCTTT 


GATCCAAAAT 


2100 


AGTGGGAAGA 


TTATGAATTA 


AACGACGCTA 


2160 


AAGGTAAAAA 


GAACAAAAAA 


AAGCGTAAGG 


2220 


ATAAGGCGAA 


AAGATTGGAC 


AATATGTTGA 


2280 


AAACTTCGTT 


TAGGGGTTTC 


AAGGACTTGA 


2340 


TGATTACGGG 


ACCTATTGGT 


ACTGGTAAAT 


2400 


TGAGAAAAAT 


TGATGGTAAG 


GTTGAAGTCA 


2460 
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ACGCCGACTT 


ATTAATGTGT 


GGTTATCCAT 


GGATTCAAAA 


TGCATCTGTA 


AGAGATAACA 


2520 


TCATATTCGG 


TTCACCATTC 


AATAAAGAAA 


AGTATGATGA 


AGTAGTTCGT 


GTTTGCTCTT 


2580 


TGAAAGCTGA 


TCTGGATATT 


TTACCGGCAG 


GCGATATGAC 


CGAAATTGGG 


GAACGTGGTA 


2640 


TTACTTTATC 


TGGTGGTCAA 


AAGGCACGTA 


TCAATTTAGC 


CAGGTCTGTT 


TATAAGAAGA 


2700 


AGGATATTTA 


TGTATTCGAC 


GATGTCCTAA 


GTGCTGTCGA 


TTCTCGTGTT 


GGTAAACACA 


2760 


TCATGGATGA 


ATGTCTAACC 


GGAATGCTTG 


CTAATAAAAC 


CAGAATTTTA 


GCAACGCATC 


2820 


AGTTGTCACT 


GATTGAGAGA 


GCTTCTAGAG 


TCATCGTTTT 


AGGTACTGAT 


GGCCAAGTCG 


2880 


ATATTGGTAC 


TGTTGATGAG 


CTAAAAGCTC 


GTAATCAAAC 


TTTGATAAAT 


CTTTTACAAT 


2940 


TCTCTTCTCA 


AAATTCGGAG 


AAAGAGGATG 


AAGAACAGGA 


AGCGGTTGTT 


TCCCGTGAAT 


3000 


TGGGACAACT 


AAAATATGAA 


CCAGAGGTAA 


AGGAATTGAC 


TGAACTGAAG 


AAAAAGGCTA 


3060 


CAGAAATGTC 


ACAAACTGCA 


AATAGTGGTA 


AAATTGTACC 


GGATGGTCAT 


ACTAGTAGTA 


3120 


AAGAAGAAAG 


AGCAGTCAAT 


AGTATCAGTC 


TGAAAATATA 


CCGTGAATAC 


ATTAAAGCTG 


3180 


CAGTAGGTAA 


GTGGGGTTTT 


ATCGCACTAC 


CGTTGTATGC 


AATTTTAGTC 


GTTGGAACCA 


3240 


CATTCTGCTC 


ACTTTTTTCT 


TCCGTTTGGT 


TATCTTACTG 


GACTGAGAAT 


AAATTCAAAA 


3300 


ACAGACCACC 


CAGTTTTTAT 


ATGGGTCTTT 


ACTCCTTCTT 


TGTGTTTGCT 


GCTTTCATAT 


3360 


TCATGAATGG 


CCAGTTCACC 


ATACTTTGCG 


CAATGGGTAT 


TATGGCATCG 


AAATGGTTAA 


3420 


ATTTGAGGGC 


TGTGAAAAGA 


ATTTTACACA 


CTCCAATGTC 


ATACATAGAT 


ACCACACCTT 


3480 


TGGGACGTAT 


TCTGAACAGA 


TTCACAAAAG 


ATACAGATAG 


CTTAGATAAT 


GAGTTAACCG 


3540 


AAAGTTTACG 


GTTGATGACA 


TCTCAATTTG 


CTAATATTGT 


AGGTGTTTGC 


GTCATGTGTA 


3600 


TTGTTTACTT 


GCCGTGGTTT 


GCTATCGCAA 


TTCCGTTTCT 


TTTGGTCATC 


TTTGTTCTGA 


3660 


TTGCTGATCA 


TTATCAGAGT 


TCTGGTAGAG 


AAATTAAAAG 


ACTTGAAGCT 


GTGCAACGGT 


3720 


CTTTTGTTTA 


CAATAATTTA 


AATGAAGTTT 


TGGGTGGGAT 


GGATACAATC 


AAAGCATACC 


3780 


uAAu 1 LAuuA 


A CP ATTTTTP 
ALU A I 1 1 1 1 U 


uLUAAAl LAU 


ATTTTTTGAT 


CAACAAGATG 


AATGAGGCGG 


3840 


GATACCTTGT 


AGTTGTCCTG 


CAAAGATGGG 


TAGGTATTTT 


CCTTGATATG 


GTTGCTATCG 


3900 


CATTTGCACT 


AATTATTACG 


TTATTGTGTG 


TTACGAGAGC 


CTTTCCTATT 


TCCGCGGCTT 


3960 


CAGTTGGTGT 


TTTGTTGACT 


TATGTATTAC 


AATTGCCTGG 


TCTATTAAAT 


ACCATTTTAA 


4020 


GGGCAATGAC 


TCAAACAGAG 


AATGACATGA 


ATAGTGCCGA 


AAGATTGGTA 


ACATATGCAA 


4080 


CTGAACTACC 


ACTAGAGGCA 


TCCTATAGAA 


AGCCCGAAAT 


GACACCTCCA 


GAGTCATGGC 


4140 


CCTCAATGGG 


CGAAATAATT 


TTTGAAAATG 


TTGATTTTGC 


CTATAGACCT 


GGTTTACCTA 


4200 
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TACTTTTAAA 


AAATCTTAAC 


TTGAATATCA 


AGAGTGGGGA 


AAAAATTGGT 


ATCTGTGGTC 


4260 


GTACAGGTGC 


TGGTAAGTCC 


ACTATTATGA 


GTGCCCTTTA 


CAGGTTGAAT 


GAATTGACCG 


4320 


CAGGTAAAAT 


TTTAATTGAC 


AATGTTGATA 


TAAGTCAGCT 


GGGACTTTTC 


GATTTAAGAA 


4380 


GAAAATTAGC 


CATCATTCCA 


CAAGATCCAG 


TATTATTTAG 


GGGTACGATT 


CGCAACAACT 


4440 


TAGATCCATT 


TAATGAGCGT 


ACAGATCACG 


AATTATGGGA 


TGCATTGGTG 


AGAGGTGGTG 


4500 


CTATCGCCAA 


GGATGACTTG 


CCGGAAGTGA 


AATTGCAAAA 


ACCTGATGAA 


AATGGTACTC 


4560 


ATGGTAAAAT 


GCATAAGTTC 


CATTTAGATC 


AAGCAGTGGA 


AGAAGAGGGC 


TCCAATTTCT 


4620 


CCTTAGGTGA 


GAGACAACTA 


TTAGCATTAA 


CAAGGGCATT 


GGTCCGCCAA 


TCAAAAATAT 


4680 


TGATTTTGGA 


TGAGGCTACA 


TCCTCAGTGG 


ACTACGAAAC 


GGATGGCAAA 


ATCCAAACAC 


4740 


GTATTGTTGA 


GGAATTTGGA 


GATTGTACAA 


TTTTGTGTAT 


TGCTCACAGA 


CTGAAGACCA 


4800 


TTGTAAATTA 


IGATCGTATT 


CTTGTTTTAG 


AGAAGGGTGA 


AGTCGCAGAA 


TTCGATACAC 


4860 


CATGGACGTT 


GTTTAGTCAA 


GAAGATAGTA 


TTTTCAGAAG 


CATGTGTTCT 


ACATCTCCTA 


4920 


TTGTGGAAAA 


TGATTTCGAG 


AACAGAAGTT 


AATTTATATT 

An 1 1 inlnl I 


ATTTftTTf.f A 


TP ATTTTTPT 
luA 1 i 1 1 I L 1 


A Q Qfl 


CTTTTATTTA 


TTTATATGTT 


GCCGATGGTA 


CAAATTAGTA 


CTAGAAAAGA 


AAACCCACTA 


5040 


CTATGACTTG 


CAGAAAAAGT 


TATGTGTGCC 


ATAGATAGAT 


ATAATTGCAT 


ACCCACATCG 


5100 


TATACTCAAA 


ATTCCGAAAA 


GAACATTTCA 


TTTTTTATGA 


GGCAAACTGA 


ACAACGCTTC 


5160 


GGTCCTTTTT 


"TCATTCTAGA 


AATATATATT 


TATACATCAT 


TTTCAGAAGA 


TATTCAAAGA 


-5220 


ACTTATTGGG 


ATGTCTATTT 


ACTGAATAAA 


GTATACACAA 


AAAACGAATT 


TAAAATGGAA 


5280 


GGCATAAATA 


CAAAACTTAC 


AAGTGAAAAT 


CCTAAAACCG 


AAGGATATTT 


CAAATACGTA 


5340 



SEQ ID NO : 10 
SEQUENCE LENGTH : 1477 
SEQUENCE TYPE : amino acid 
STRANDEDNESS : single 
TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Met Thr Me Thr Val Gly Asp Ala Val Ser Glu Thr Glu Leu Glu 
5 10 15 



38 



EP 0 644 262 A2 



Asn Lys Ser Gin Asn. Vai Val Leu Ser Pro Lys Ala Ser Ala Ser 

20 25 30 

Ser Asp lie Ser Thr Asp Val Asp Lys Asp Thr Ser Ser Ser Trp 

35 40 45 

Asp Asp Lys Ser Leu Leu Pro Thr Gly Glu Tyr lie Val Asp Arg 

50 55 60 

Asn Lys Pro Gin Thr Tyr Leu Asn Ser Asp Asp lie Glu Lys Val 

65 70 75 

Thr Glu Ser Asp He Phe Pro Gin Lys Arg Leu Phe Ser Phe Leu 

80 85 90 

His Ser Lys Lys He Pro Glu Val Pro Gin Thr Asp Asp Glu Arg 

95 100 105 

Lys He Tyr Pro Leu Phe His Thr Asn lie He Ser Asn Met Phe 

110 115 120 

Phe Trp Trp Val Leu Pro He Leu Arg Val Gly Tyr Lys Arg Thr 

125 130 135 

He Gin Pro Asn Asp Leu Phe Lys Met Asp Pro Arg Met Ser He 

140 145 150 

Glu Thr Leu Tyr Asp Asp Phe Glu Lys Asn Met He Tyr Tyr Phe 

155 160 165 

Glu Lys Thr Arg Lys Lys Tyr Arg Lys Arg His Pro Glu Ala Thr 

170 175 180 

Glu Glu Glu Val Met Glu Asn Ala Lys Leu Pro Lys His Thr Val 

185 190 195 

Leu Arg Ala Leu Leu Phe Thr Phe Lys Lys Gin Tyr Phe Met Ser 

200 205 210 

lie Val Phe Ala He Leu Ala Asn Cys Thr Ser Gly Phe Asn Pro 

215 220 225 

Met He Thr Lys Arg Leu He Glu Phe Val Glu Glu Lys Ala He 
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230 235 240 

Phe His Ser Met His Val Asn Lys Giy He Gly Tyr Ala lie Gly 

245 250 255 

Ala Cys Leu Met Met Phe Val Asa Gly Leu Thr Phe Asn His Phe 

260 265 270 

Phe His Thr Ser Gin Leu Thr Gly Val Gin Ala Lys Ser lie Leu 

275 280 285 

Thr Lys Ala Ala Met Lys Lys Met Phe Asn Ala Ser Asn Tyr Ala 

290 295 300 

Arg His Cys Phe Pro Asn Gly Lys Val Thr Ser Phe Val Thr Thr 

305 310 315 

Asp Leu Ala Arg lie Giu Phe Ala Leu Ser Phe Gin Pro Phe Leu 

320 325 330 

Ala Gly Phe Pro Ala He Leu Ala He Cys He Val Leu Leu He 
• 335 340 345 

Val Asn Leu Gly Pro lie Ala Leu Val Gly He Gly He Phe Phe 

350 355 360 

Gly Gly Phe Phe He Ser Leu Phe Ala Phe Lys Leu He Leu Gly 

365 370 375 

Phe Arg He Ala Ala Asn He Phe Thr Asp Ala Arg Val Thr Met 

380 385 390 

Met Arg Glu Val Leu Asn Asn He Lys Met He Lys Tyr Tyr Thr 

395 400 405 

Trp Glu Asp Ala Tyr Glu Lys Asn He Gin Asp He Arg Thr Lys 

410 415 420 

Glu He Ser Lys Val Arg Lys Met Gin Leu Ser Arg Asn Phe Leu 

425 430 435 

He* Ala Met Ala Met Ser Leu Pro Ser He Ala Ser Leu Val Thr 

440 445 450 
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Phe Leu Ala Met Tyr Lys Val Asn Lys Gly Gly Arg Gin Pro Gly 
455 460 465 

Asn He Phe Ala Ser Leu Ser Leu Phe Gin Val Leu Ser Leu Gin 

470 475 480. 

Met Phe Phe Leu Pro lie Ala He Gly Thr Gly He Asp Met He 

485 490 495 

He Gly Leu Gly Arg Leu Gin Ser Leu Leu Glu Ala Pro GIu Asp 

500 505 510 

Asp Pro Asn Gin Met He Glu Met Lys Pro Ser Pro Gly Phe Asp 

5L5 520 525 

Pro Lys LeuAla Leu Lys Met Thr His Cys Ser Phe Glu Trp Glu 

530 535 ' 540 

Asp Tyr Glu Leu Asn Asp Ala He Glu Glu Ala Lys Gly Glu Ala 

545 550 555 

Lys Asp Glu Gly Lys Lys Asn Lys Lys Lys Arg Lys Asp Thr Trp 

560 565 570 

Gly Lys Pro Ser Ala Ser Thr Asn Lys Ala Lys Arg Leu Asp Asn 

575 580 585 

Met Leu Lys Asp Arg Asp Gly Pro Glu Asp Leu Glu Lys Thr Ser 

590 595 600 

Phe Arg Gly Phe Lys Asp Leu Asn Phe Asp He Lys Lys Gly Glu 

605 610 615 

' Phe He Met He Thr Gly Pro lie Gly Thr Gly Lys Ser Ser Leu 

620 625 630 

Leu Asn Ala Met Ala Gly Ser Met Arg Lys He Asp Gly Lys Val 

635 640 645 

Glu Val Asn Gly Asp Leu Leu Met Cys Gly Tyr Pro Trp He Gin 

650 655 660 

Asn Ala Ser Val Arg Asp Asn He lie Phe Gly Ser Pro Phe Asn 
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665 670 675 

Lys GIu Lys Tyr Asp Clu Val Val Arg Vai Cys Ser Leu Lys Ala 

680 685 690 

Asp Leu Asp lie Leu Pro Ala Gly Asp Met Thr Giu He Gly Glu 

695 700 705 

Arg Gly He Thr Leu Ser Gly Gly Gin Lys Ala Arg He Asn Leu 

710 715 720 

Ala Arg Ser Val Tyr Lys Lys Lys Asp He Tyr Val Phe Asp Asp 

725 730 735 

Val Leu Ser Ala Val Asp Ser Arg Val Gly Lys His He Met Asp 

740 745 750 

Glu Cys Leu Thr Gly Met Leu Ala Asn Lys Thr Arg lie Leu Ala 

755 760 765- 

Thr His Gin Leu Ser Leu He Glu Arg Ala Ser Arg Val He Val 

770 775 780 

Leu Gly Thr Asp Gly Gin Val Asp He Gly Thr Val Asp Glu Leu 

785 790 795 

Lys Ala Arg Asn Gin Thr Leu He Asn Leu Leu Gin Phe Ser Ser 

800 805 810 

Gin Asn Ser Glu Lys Glu Asp Glu Glu Gin Glu Ala Val Val Ser 

815 820 825 

Gly Glu Leu Gly Gin Leu Lys Tyr Glu Pro Glu Val Lys Glu Leu 

830 835 840 

Thr Glu Leu Lys Lys Lys Ala Thr Glu Met Ser Gin Thr Ala Asn 

845 850 855 

Ser Gly Lys He Val Ala Asp Gly His Thr Ser Ser Lys Glu Glu 

860 865 870 

Arg Ala Val Asn Ser He Ser Leu Lys He Tyr Arg Glu Tyr lie 

875 880 885 
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Lys Ala Ala Val Cly Lys Trp Gly Phe He Ala Leu Pro Leu Tyr 

890 895 900 

Ala lie Leu Val Val Gly Thr Thr Phe Cys Ser Leu Phe -Ser Ser 

905 910 915 

Val Trp Leu Ser Tyr Trp Thr Glu Asn Lys Phe Lys Asn Arg Pro 

920 925 930 

Pro Ser Phe Tyr Het Gly Leu Tyr Ser Phe Phe Val Phe Ala Ala 

935 940 945 

Phe He Phe Met Asn Gly Gin Phe Thr He Leu Cys Ala Met Gly 

950 955 960 

lie Met Ala Ser Lys Trp Leu Asn Leu Arg Ala Val Lys Arg He 

965 970 975 

Leu His Thr Pro Het Ser Tyr lie Asp Thr Thr Pro Leu Gly Arg 

980 985 990 

He Leu Asn Arg Phe Thr Lys Asp Thr Asp Ser Leu Asp Asn Glu 

995 1000 1005 

Leu Thr Glu Ser Leu Arg Leu Met Thr Ser Gin Phe Ala Asn He 

1010 1015 1020 

Val Gly Val Cys Val Met Cys He Val Tyr Leu Pro Trp Phe Ala 

1025 1030 1035 

He Ala He Pro Phe Leu Leu Val He Phe Val Leu He Ala Asp 

1040 1045 1050 

His Tyr Gin Ser Ser Gly Arg Glu He Lys Arg Leu Glu Ala Val 
■ 1055 1060 1065 

Gin Arg Ser Phe Val Tyr Asn Asn Leu Asn Glu Val Leu Gly Gly 

1070 1075 1080 

Met Asp Thr He Lys Ala Tyr Arg Ser'Gln Glu Arg Phe Leu Ala 

1085 1090 1095 

Lys Ser Asp Phe Leu He Asn Lys Met Asn Glu Ala Gly Tyr Leu 
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1100 H05 1110 

Val Val Val Leu Gin Arg Trp Val Gly He Phe Leu Asp Met Val 

1115 1120 1125 

Ala He Ala Phe Ala Leu He He Thr Leu Leu Cys Val Thr Arg 

1130 1135 1140 

Ala Phe Pro He Ser Ala Ala Ser Val Gly Val Leu Leu Thr Tyr 

1145 1150 1155 

Val Leu Gin Leu Pro Gly Leu Leu Asn Thr He Leu Arg Ala Met 

1160 1165 1170 

Thr Gin Thr Glu Asn Asp Met Asn Ser Ala Giu Arg Leu Val Thr 
• 1175 1 180 1185 

Tyr Ala Thr Glu Leu Pro Leu Glu Ala Ser Tyr Arg Lys Pro Glu 

1190 1195 1200 

Met Thr Pro Pro Glu Ser Trp Pro Ser Met Gly Glu He lie Phe 

1205 1210 1215 

Glu Asn Val Asp Phe Ala Tyr Arg Pro Gly Leu Pro He Val Leu 

1220 1225 1230 

Lys Asn Leu Asn Leu Asn lie Lys Ser Gly Glu Lys He Gly He 

1235- 1240 1245 

Cys Gly Arg Thr Gly Ala Gly Lys Ser Thr He Met Ser Ala Leu 

1250 1255 1260 

Tyr Arg Leu Asn Glu Leu Thr Ala Gly Lys He Leu He Asp Asn 

1265 1270 1275 

Val Asp He Ser Gin Leu Gly Leu Phe Asp Leu Arg Arg Lys Leu 

1280 1285 1290 

Ala He He Pro Gin Asp Pro Val Leu Phe Arg Gly Thr He Arg 

1295 1300 1305 

Lys Asn Leu Asp Pro Phe Asn Glu Arg Thr Asp Asp Glu Leu Trp 

1310 1315 ' 1320 
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Asp Ala Leu Val Arg Gly Gly Ala lie Ala Lys Asp Asp Leu Pro 

1325 ' 1330 1335 

Glu Val Lys Leu Gin Lys Pro Asp Glu Asn Gly Thr His Gly Lys 

1340 1345 1350 

Met His Lys Phe His Leu Asp Gin Ala Val Glu Glu Glu Gly Ser 

1355 1360 1365 

Asn Phe Ser Leu Gly Glu Arg Gin Leu Leu Ala Leu Thr Arg Ala 

1370 1375 1380 

Leu Val Arg Gin Ser Lys He Leu lie Leu Asp Glu Ala Thr Ser 

1385 1390 1395 

Ser Val Asp' Tyr Glu Thr Asp Gly Lys He Gin Thr Arg He Val 

1400 1405 1410 

Glu Glu Phe Gly Asp Cys Thr He Leu Cys lie Ala His Arg Leu 

1415 1420 1425 

Lys Thr He Val Asn Tyr Asp Arg lie Leu Val Leu Glu Lys Gly 

1430 1435 1440 

Glu Val Ala Glu Phe Asp Thr Pro Trp Thr Leu Phe Ser Gin Glu 

1445 1450 1455 

Asp Ser He Phe Arg Ser Met Cys Ser Arg Ser Gly He Val Glu 

1460 1465 1470 
Asn Asp Phe Glu Asn Arg Ser 

1475 

SEQ ID NO : 11 
SEQUENCE LENGTH : 26 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : single 
TOPOLOGY : linear 

MOLECULE TYPE : Other nucleic acid(synthetic DNA) 
SEQUENCE DESCRIPTION : 
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TTTGGTTAYA TGAAYYTNTT YGGNGT 26 

SEQ ID NO : 12 
SEQUENCE LENGTH : 29 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : single 
TOPOLOGY : linear 

MOLECULE TYPE : Other nucleic acid(synthetic DNA) 

SEQUENCE DESCRIPTION : 

TCTACAAART ARTGGTGNGT NARRTACAT 29 

SEQ ID NO : 13 
SEQUENCE LENGTH : 2274 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : double 
TOPOLOGY : linear 
MOLECULE TYPE : genomic DNA 
SEQUENCE DESCRIPTION : 

TTATATATAT TATTGATTTG TTCCTGTTGT TATTTAGTTT AGAATCAGAC GACTACACCA 60 
GAACCACAAT TCAACCAACA CTTATATAGA ACCTGGCTTG GAAAAAAGTA ACATTTATCA 120 
TTCCTATACT TTTTTAGCAA ACATAATCCG TGTTTTACAT ATATTATTCA CCCAATATCA 180 
TAACAAAAAC AAACTGAATA ATGGCGTCTT CTATTTTGCG TTCCAAAATA ATACAAAAAC 240 
CGTACCAATT ATTCCACTAC TATTTTCTTC TGGAGAAGGC TCCTGGTTCT ACAGTTAGTG 300 
ATTTGAATTT TGATACAAAC ATACAAACGA GTTTACGTAA ATTAAAGCAT CATCATTGGA 380 
CGGTGGGAGA AATATTCCAT TATGGGTTTT TGGTTTCCAT ACTTTTTTTC GTGTTTGTGG 420 
TTTTCCCAGC TTCATTTTTT ATAAAATTAC CAATAATCTT AGCATTTGCT ACTTGTTTTT 480 
TAATACCCTT AACATCACAA TTTTTTCTTC CTGCCTTGCC CGTTTTCACT TGGTTGGCAT 540 
TATATTTTAC GTGTGCTAAA ATACCTCAAG AATGGAAACC AGCTATCACA GTTAAAGTTT 600 
TACCAGCTAT GGAAACAATT TTGTACGGCG ATAATTTATC AAATGTTTTG GCAACCATCA 660 
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CTACCGGAGT GTTAGATATA TTGGCATGGT TACCATATGG GATTATTCAT TTCAGTTTCC 720 
CATTTGTACT TGCTGCTATT ATATTTTTAT TTGGGCCACC GACGGCATTA AGATCATTTG 780 
GATTTGCCTT TGGTTATATG AACTTGCTTG GAGTCTTGAT TCAAATGGCA TTCCCACCTC 840 
CTCCTCCATG GTACAAAAAC TTGCACGGAT TAGAACCAGC TAATTATTCA ATGCACGGGT 900 
CTCCTGGTGG ACTTGGAAGG ATAGATAAAT TGTTAGGTGT TGATATGTAT ACCACAGGGT 960 
TTTCCAATTC ATCAATCATT TTTGGGGCAT TCCCATCGTT ACATTCAGGA TGTTGTATCA 1020 
TGGAAGTGTT ATTTTTGTGT TGGTTGTTTC CACGATTCAA GTTTGTGTGG GTTACATACG 1080 
CATCTTGGCT TTGCTGGAGC ACGATGTATT TGACCCATCA CTACTTTGTC GATTTGATTG 1140 
GTGGAGCCAT GCTATCTTTG ACTGTTTTTG AGTTCACCAA ATATAAATAT TTGCCAAAAA 1200 
ACAAAGAAGG CCTTTTCTGT CGTTGGTCAT ACACTGAAAT TGAAAAAATC GATATCCAAG 1260 
AGATTGACCC TTTATCATAC AATTATATCC CTGTCAACAG CAATGATAAT GAAAGCAGAT 1320 
TGTATACGAG AGTGTACCAA GAGTCTCAGG TTAGTCCCCC ACAGAGAGCT GAAACACCTG 1380 
AAGCATTTGA GATGTCAAAT TTTTCTAGGT CTAGACAAAG CTCAAAGACT CAGGTTCCAT 1440 
TGAGTAATCT TACTAACAAT GATCAAGTGT CTGGAATTAA CGAAGAGGAT GAAGAAGAAG 1500 
AAGGCGATGA AATTTCATCG AGTACTCCTT CGGTGTTTGA AGACGAACCA CAGGGTAGCA 1560 
CATATGCTGC ATCCTCAGCT ACATCAGTAG ATGATTTGGA TTCCAAAAGA AATTAGTAAA 1620 
ATAACAGTTT CTATTAATTT CTTTATTTCC TCCTAATJAA TGATTTTATG CTCAATACCT 1680 
ACACTATCTG TTTTTAATTT CCTACTTTTT TTTTATTATT GTTGAGTTCA TTTGCTGTTC 1740 
ATTGAATATT TACAATTTTG CATTAATTAC CATCAATATA GAATGGGCAC AGTTTTTTTA 1800 
AGTTTTTTTG TTTTTGTGTT TGTCTTTCTT TTTTTACATT AATGTGTTTG GATTGTTTTA 1860 
GGTTCCTTTA TCCCTTAGCC CCCTCAGAAT ACTATTTTAT CTAATTAATT TGTTTTTATT 1920 
TTCTGATATT TACCAATTGC TTTTTCTTTT GGATATTTAT AATAGCATCC CCTAATAATT 1980 
AATATACAAC TGTTTCATAT ATATACGTGT ATGTCCTGTA GTGGTGGAAA CTGGAGTCAA 2040 
CATTTGTATT AATGTGTACA AGAAAGCAGT GTTAATGCTA CTATTATAAT TTTTGAGGTG 2100 
CAAATCAAGA GGTTGGCAGC TTTCTTATGG CTATGACCGT GAATGAAGGC TTGTAAACCA 2160 
CGTAATAAAC AAAAGCCAAC AAGTTTTTTT AGAGCCTTTA ACAACATACG CAATGAGAGT 2220 
GATTGCAATA CTACAAGATA TAGCCCAAAA AATTGAATGC ATTTCAACAA CAAC 2274 

.SEQ ID NO : 14 
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SEQUENCE LENGTH : 471 
SEQUENCE TYPE : amino acid 
STRANDEDNESS : single 
TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Met Ala Ser Ser He Leu Arg Ser Lys lie He Gin Lys Pro Tyr 

5 10 15 

Gin Leu Phe His Tyr Tyr Phe Leu Ser Glu Lys Ala Pro Gly Ser 

20 25 30 

Thr Val Ser Asp Leu Asn Phe Asp Thr Asn lie Gin Thr Ser Leu 

35 40 45 

Arg Lys Leu Lys His His His Trp Thr Val Gly Glu He Phe His 

50 55 60 

Tyr Gly Phe Leu Val Ser lie Leu Phe Phe Val Phe Val Val Phe 

65 70 75' 

Pro Ala Ser Phe Phe He Lys Leu-Pro He He Leu Ala Phe Ala 

80 85 90 

Thr Cys Phe Leu He -Pro Leu Thr Ser Gin Phe Phe Leu Pro Ala 

95 100 105 

Leu Pro Val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala Lys 

110 115 120 

He Pro Gin Glu Trp Lys Pro Ala He Thr Val Lys Val Leu Pro 

125 130. 135 
Ala Met Glu Thr He Leu Tyr Gly Asp Asn Leu Ser Asn Val Leu 

140 145 150 
Ala Thr He Thr Thr Gly Val Leu Asp He Leu Ala Trp Leu Pro 

155 160 165 

Tyr Gly He He His Phe Ser Phe Pro Phe Val Leu Ala Ala He 
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170 175 180 

lie Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg Ser Phe Gly Phe 

185 190 195 

Ala Phe Gly Tyr Met Asn Leu Leu Gly Val Leu lie Gin Met Ala 

200 205 210 

Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu Glu 

215 220 225 

Pro Ala Asn Tyr Ser Met His Gly Ser Pro Gly Gly Leu Gly Arg 

230 ' 235 240 

lie Asp Lys Leu Leu Gly Val Asp Met Tyr Thr Thr Gly Phe Ser 

245 250 255 

Asn Ser Ser He lie Phe Gly Ala Phe Pro Ser Leu His Ser Gly 

260 265 270 

Cys Cys lie Met Glu Val Leu Phe Leu Cys Trp Leu Phe Pro Arg 

275 280 285 

Phe Lys Phe Val Trp Val Thr Tyr Ala Ser Trp Leu Trp Trp Ser 

290 295 " 300 

Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu He Gly Gly 

305 • 310 315 

Ala Met Leu Ser Leu Thr Val Phe Glu Phe Thr Lys Tyr Lys Tyr 

320 325 330 

Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr Thr 

335 340 345 

Glu lie Glu Lys He Asp He Gin Glu He Asp Pro Leu Ser Tyr 

350 355 360 

Asn Tyr He Pro Vai Asn Ser Asn Asp Asn Glu Ser Arg Leu Tyr 

365 370 375 

Thr Arg Val Tyr Gin Glu Ser Gin Val Ser Pro Pro Gin Arg Ala 

380 385 390 
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w 



15 



20 



Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe Ser Arg Ser Arg 
395 400 405 

Gin Ser Ser Lys Thr Gin Val Pro Leu Ser Asn Leu Thr Asn Asn 
410 415 420 

Asp Gin Val Ser Gly He Asn Glu Glu Asp Glu Glu Glu Glu Gly 
425 430 435 

Asp Glu He Ser Ser Ser Thr Pro Ser Val Phe Glu Asp Giu Pro 
440 445 450 

Gin Gly Ser Thr Tyr Ala Ala Ser Ser Ala Thr Ser Val Asp Asp 
455 460 465 

Leu Asp Ser Lys Arg Asn 
470 

SEQ ID NO : 15 
25 . SEQUENCE LENGTH : 243 

SEQUENCE TYPE : nucleic acid 

STRANOEDNESS : double 
30 TOPOLOGY : linear 

MOLECULE TYPE : genomic DNA 

SEQUENCE DESCRIPTION : 
35 TTTGAAAAAT TTGAATTTTA AAATTAATCC AATGGAAAAA ATTGGTATTT GTCGAAGAAC 60 

CGGTGCTGGT AAATCATCAA TTATGACAGC ATTATATCGA TTATCAGAAT TAGAACTGGG 120 

GAAAATTATT ATTGATGATA TTGATATTTC AACTTTGGGT TTAAAAGATC TTCGATCAAA 180 

40 

ATTATCAATT ATTCCTCAAG ATCCAGTATT ATTCCGAGGT TCAATTCGGA AAAACTTGGA 240 
TCC 243 

45 

SEQ ID NO : 16 
SEQUENCE LENGTH : 80 
SEQUENCE TYPE : amino acid 

50 

STRANDEDNESS : single 
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TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : - 

Leu Lys Asn Leu Asn Phe Lys lie Asn Pro Met Glu Lys lie Gly 
5 10 15 

lie Cys Gly Arg Thr Gly Ala Gly Lys Ser Ser He Met Thr Ala 
20 25 30 

Leu Tyr Arg Leu Ser Glu Leu Giu Leu Gly Lys He lie lie Asp 
35 . 40 45 

Asp lie Asp lie Ser Thr Leu Gly Leu Lys Asp Leu Arg Ser Lys 
50 55 60 

Leu Ser He lie Pro Gin Asp Pro Val Leu Phe Arg Gly Ser He 
65 70 75 

Arg Lys Asn Leu Asp 
80 

SEQ ID NO : 17 
SEQUENCE LENGTH : 1601 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : double 
35 TOPOLOGY : linear 

MOLECULE TYPE : genomic DNA 
ANTI-SENSE : Yes 
40 SEQUENCE DESCRIPTION : 

AGGAAGATGA CTTGCATCAA AGATGGAGGA AGTGGTACTG GCAGGACGAT CAATCAAATC 60 
AGCAGCAGGA CTAGGTAACG GCTCAGGTGA TGATGAACCC ACGGACCATT CATGATCGGT 120 
45 GTTAGCAAGT TCCATATTGT TAAGACCACT CATGAAGGCT ACTGCATTAG GGTTTTGAGT 180 

AAAAGAATCC CTTCCAAGTA AGTATGGGCT GCCGGTACGA GCCAAGGAGT TGCTGGTTTT 240 
TTCGGAAAGA CCATGACCGT GGATAACAAA CTCGTATTCC CAACGAAGGA TTTTACCAGT 300 
50 TTGCAACTGT GGGAGGCGTA GCTTTTGAGC AAAAACGAAG CATATAATAG CTAAACACAT 360 
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ACCGCCGACC AAATCTACAA AGTAGTGGTC GGTAAGGTAC ATAGTACACC AGCAAAGCCA 420 

TAGAACATAT CCATAAAAGC AGAAGCGGTA TCGAGGAAAC ACATGCGAAA GGAAAAGTGC 480 

TTCCAGCATG GCCCATCCAG CGTGAAGAGA TGGAAAGGCA CCAAAAACAA CCGGAGAGTT 540 

AGAAAAACCA TCAGTGTAAA TGCTAGTGCC GAAGAGAGCA TCAATACGGG CCAATCCACC 600 

,AGGAGAGCCA CGTACTGCAT ACGTGGCAGG TTCTAAACCA TACATATTTT CATACCAAGG 660 

AGGAGAACAG GGGAAAGCCA TTTGGATAAG AACACCAAAT AAATTCATAT AACCAAAAGT 720 

TCGAGCCCAA ACTGGAAGAG TTCCAGGAGG TGCAAAGATG AAAAGAATAA ATGAAATGAT 780 

AAAAGGAGCC GAATAATGCA TGACTCCATA TGGAACCCAG GCCAAAATAT CAAGGATGCT 840 

ATGCGTGGTT TTCGAGAGAA GACTAGAAAG ATTAGAGCCA TAAAGAATAT TTTCAAGTGT 900 

GGGTAAAACA CGAACCCATA TGGGTGGACG CCAGCGTTCT GGAATAAACC TACAAGAGTA 960 

AAATAAAATT GCCCAGGTGA TGATAACAAT GGCAGGAAAA AAAATTTGGC GTGTTAAAGG 1020 

AACGGTCAAC GCAATGGCCA AAAGACAGGC AATGCCAAAT TTCCCCCAGA ATCCAGGAGA 1080 

TTCAATGACA ATACAAGCAA AAATCAAATT ACCTGCTAGA AACACATATT GCAAATGTGT 1140 

25 CCATGACCAT TTCGTATTGC GTAGCAAACG AAATGTAGGC ATAGGGTTTA AGCTTGTTTC 1200 

CAACTTGTAT TGGGATGCTC GGTTACACGC AGCAAGGCGC TTTTTTAAGG TCGAAAGAGC 1260 

AGACATTGCT TCAAAGAATT ATCAGAGTAA AAAAGGGAAG CGTACGAAAA AAATTTCGTA 1320 

30 AGGAATTAAC CGGAAAACTA AAGGAAAAAA AAGGAATTTT TATGAAGGAA AGAAAGTAGC 1380 

TATTAAATGC AAGTGTCAAG CACTTAAAAG TAGCGATGTA AAATATTTAA AAAAAGATGG 1440 

ACCGATTAAC CAATGTTCAG CTCACAGTTG CCAGCAATCA GGGCTATTTT TTTATTTTTT 1500 

TTATAAAATT GCTAATTATA TATAATATAA TTAGTTTATT AACTTGCTTT TCCTCAAAAA 1560 

ACCAATTCGA GAAAGGAACT TTTGCAGAGG CAAAAAAGCT T 1601 
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35 



40 



45 
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SEQ ID NO : 18 
SEQUENCE LENGTH : 1601 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : double 
TOPOLOGY : linear 
MOLECULE TYPE : nRNA 
ANTI-SENSE : Yes 
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SEQUENCE DESCRIPTION : 
AGGAAGAUGA CUUCCAUCAA AGAUGGAGGA 
.AGCAGCAGGA CUAGGUAACG GCUCAGGUGA 
GUUAGCAAGU UCCAUAUUGU UAAGACCACU 
AAAAGAAUCC CUUCCAAGUA AGUAUGGGCU 
UUCGGAAAGA CCAUGACCGU GGAUAACAAA 
UUGCAACUGU GGGAGGCGUA GCUUUUGAGC 
ACCGCCGACC AAAUCUACAA AGUAGUGGUG 
UAGAACAUAU CCAUAAAAGC AGAAGCGGUA 
UUCCAGCAUG GCCCAUCCAG CGUGAAGAGA 
AGAAAAACCA UCAGUGUAAA UGCUAGUGCC 
AGGAGAGCCA CGUACUGCAU ACGUGGCAGG 
AGGAGAACAG GGGAAAGCCA UUUGGAUAAG 
UCGAGCCCAA' ACUGGAAGAG UUCCAGGAGG 
AAAAGGAGCC GAAUAAUGCA UGACUCCAUA 
AUGCGUGGUU UUCGAGAGAA GACUAGAAAG 
GGGUAAAACA CGAACCCAUA UGGGUGGACG 
AAAUAAAAUU GCCCAGGUGA UGAUAACAAU 
AACGGUCAAC GCAAUGGCCA AAAGACAGGC 
UUCAAUGACA AUACAAGCAA AAAUCAAAUU 
CCAUGACCAU UUCGUAUUGC GUAGCAAACG 
CAACUUGUAU UGGGAUGCUC GGUUACACGC 
AGACAUUGCU UCAAAGAAUU AUCAGAGUAA 
AGGAAUUAAC CGGAAAACUA AAGGAAAAAA 
UAUUAAAUGC AAGUGUCAAG CACUUAAAAG 
ACCGAUUAAC CAAUGUUCAG CUCACAGUUG 
UUAUAAAAUU GCUAAUUAUA UAUAAUAUAA 
ACCAAUUCGA GAAAGGAACU UUUGCAGAGG 



AGUGGUACUG GCAGGACGAU CAAUCAAAUC 


60 


UGAUGAACCC 


ACGGACCAUU CAUGAUCGGU 


120 


CAUGAAGGCU 


ACUGCAUUAG GGUUUUGAGU 


180 


GCCGGUACGA 


GCCAAGGAGU UGCUGGUUUU 


240 


CUCGUAUUCC 


CAACGAAGGA UUUUACCAGU 


300 


AAAAACGAAG 


CAUAUAAUAG CUAAACACAU 


360 


GGUAAGGUAC 


AUAGUACACC AGCAAAGCCA 


420 


UCGAGGAAAC 


ACAUGCGAAA GGAAAAGUGC 


480 


UGGAAAGGCA 


CCAAAAACAA CCGGAGAGUU 


540 


GAAGAGAGCA 


UCAAUACGGG CCAAUCCACC 


600 


UUCUAAACCA 


UACAUAUUUU CAUACCAACG 


660 


AACACCAAAU AAAUUCAUAU AACCAAAACU 


720 


UGCAAAGAUG 


AAAAGAAUAA AUGAAAUGAU 


780 


UGGAACCCAG 


GCCAAAAUAU CAAGGAUGCU 


840 


AUUAGAGCCA 


UAAACAAUAU UUUCAAGUGU 


900 


CCAGCGUUCU 


GGAAUAAACC UACAAGAGUA 


960 


GGCAGGAAAA 


AAAAUUUGGC GUGUUAAAGG 


1020 


AAUGCCAAAU 


UUCCCCCAGA AUCCAGGAGA 


1080 


ACCUGCUAGA 


AACACAUAUU GCAAAUGUGU 


1140 


AAAUGUAGGC 


AUAGGGUUUA AGCUUGUUUC 


1200 


AGCAAGGCGC 


UUUUUUAAGG UCGAAAGAGC 


1260 


AAAAGGGAAG 


CGUACGAAAA AAAUUUCGUA 


1320 


AAGGAAUUUU 


UAUGAAGGAA AGAAAGUAGC 


1380 


UAGCGAUGUA 


AAAUAUUUAA AAAAAGAUGG 


1440 


CCAGCAAUCA 


GGGCUAUUUU UUUAUUUUUU 


1500 


UUAGUUUAUU 


AACUUGCUUU UCCUCAAAAA 


1560 


CAAAAAAGCU 


U 


1601 
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SEQ ID NO : 19 
SEQUENCE LENGTH : 12 
SEQUENCE TYPE : amino acid 
STRANDEDNESS : single 
TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Cys Phe Thr Ser Ser Tyr Phe Pro Asp Asp Arg Arg 
5 10 

SEQ ID NO : 20 
SEQUENCE LENGTH : .19 
SEQUENCE TYPE : amino acid 
STRANDEDNESS : single 
TOPOLOGY : linear 
MOLECULE TYPE : peptide 
SEQUENCE DESCRIPTION : 

Cys Tyr Thr Ser He Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro 
5 10 15 

Leu Ala Ala Asp 

SEQ ID NO : 21 
SEQUENCE LENGTH : 1553 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS : double 
TOPOLOGY : linear 
MOLECULE TYPE : Genomic DNA . 

SEQUENCE DESCRIPTION : ' 
TTTTACATAT ATTATTCACC CAATATCATA ACAAAAACAA ACTGAATGAT GGCATCTTCT 60 
ATTTTGCGTT CCAAAATAAT ACAAAAACCG TACCAATTAT TCCACTACTA TTTTCTTCTG 120 
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GAGAACCCTC CTGGTTCTAC AGTTAGTGAT TTGAATTTTG ATACAAACAT ACAAACGAGT 180 

TTACGTAAAT TAAAGCATCA TCATTGGACG GTGGGAGAAA TATTCCATTA TGGGTTTTTG 240 

GTTTCCATAC TTTTTTTCGT GTTTGTGGTT TTCCCAGCTT CATTTTTTAT AAAATTACCA 300 

ATAATCTTAG CATTTGCTAC TTGTTTTTTA ATACCCTTAA CATCACAATT TTTTCTTCCT 360 

GCCTTGCCCG TTTTCACTTG GTTGGCATTA TATTTTACGT CTGCTAAAAT ACCTCAAGAA 420 

TGGAAACCAG CTATCACAGT TAAAGTTTTA CCAGCTATGG AAACAATTTT GTACGGCGAT 480 

AATTTATCAA ATGTTTTGGC AACCATCACT ACCGGAGTGT TAGATATATT GGCATGGTTA 540 

CCATATGGGA TTATTCATTT CAGTTTCCCA TTTGTACTTG CTGCTATTAT ATTTTTATTT 600 

GGGCCACCGA CGGCATTAAG ATCATTTGGA TTTGCCTTTG GTTATATGAA CTTGCTTGGA 660 

GTCTTGATTC AAATGGCATT CCCAGCTGCT CCTCCATGGT ACAAAAACTT GCACGGATTA 720 

GAACCAGCTA ATTATTCAAT GCACGGGTCT CCTGGTGGAC TTGGAAGGAT AGATAAATTG 780 

TTAGGTGTTG ATATGTATAC CACAGGGTTT TCCAATTCAT CAATCATTTT TGGGGCATTC 840 

CCATCGTTAC ATTCAGGATG. TTGTATCATG GAAGTGTTAT TTTTGTGTTG GTTGTTTCCA 900 

CGATTCAAGT TTGTGTGGGT TACATACGCA TCTTGGCTTT GGTGGAGCAC GATGTATTTG 960 

ACCCATCACT ACTTTGTCGA TTTGATTGGT GGAGCCATGC TATCTTTGAC TGTTTTTGAA 1020 

TTCACCAAAT ATAAATATTT GCCAAAAAAC AAAGAAGGCC TTTTCTGTCG TTGGTCATAC 1080 

ACTGAAATTG AAAAAATCGA TATCCAAGAG ATTGACCCTT TATCATACAA TTATATCCCT 1140 

GTCAACAGCA ATGATAATGA AAGCAGATTG TATACGAGAG TGTACCAAGA GCCTCAGGTT 1200 

AGTCCCCCAC AGAGAGCTGA AACACCTGAA GCATTTGAGA TGTCAAATTT TTCTAGGTCT 1260 

AGACAAACCT CAAAGACTCA GGTTCCATTG AGTAATCTTA CTAACAATGA TCAAGTGCCT 1320 

GGAATTAACG AAGAGGATGA AGAAGAAGAA GGCGATGAAA TTTCGTCGAG TACTCCTTCG 1380 

GTGTTTGAAG ACGAACCACA GGGTAGCACA TATGCTGCAT CCTCAGCTAC ATCAGTAGAT 1440 

GATTTGGATT CCAAAAGAAA TTAGTAAAAC AGCAGTTTCT ATTAATTTCT TTATTTCCTC 1500 

CTAATTAATG ATTTTATGTT CAATACCTAC ACTATCTGTT TTTAATTTCC TAC 1553 

SEQ ID NO : 22 
SEQUENCE LENGTH : 472 
SEQUENCE TYPE : amino acid 
STRANDEDNESS : single 
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TOPOLOGY : linear 

MOLECULE TYPE : peptide 

SEQUENCE DESCRIPTION : . 

Met Met Ala Ser Ser lie Leu Arg Ser Lys lie He Gin Lys Pro 

15 10 15 

Tyr Gin Leu Phe His Tyr Tyr Phe Leu Leu Glu Lys Ala Pro Gly 

20 25 30 

Ser Thr Val Ser Asp Leu Asn Phe Asp Thr Asn He Gin Thr Ser 

35 " 40 45 

Leu Arg Lys Leu Lys His His His Trp Thr Yal Gly Glu He Phe 

50 55 60 

His Tyr Gly Phe Leu Val Ser He Leu Phe Phe Val Phe Val Val 

65 70 75 

Phe Pro Ala Ser Phe Phe He Lys Leu Pro He He Leu Ala Phe 

80 85 90 

Ala Thr Cys Phe Leu He Pro Leu Thr Ser Gin Phe Phe Leu Pro 

95 100 105 

Ala Leu Pro Val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala 

HO . 115 120 

Lys He Pro Gin Glu Trp Lys Pro- Ala He Thr Val Lys Val Leu 

125 130 135 

Pro Ala Met Glu Thr He Leu Tyr Gly Asp Asn Leu Ser Asn Val 

140 * 145 150 

Leu Ala Thr He Thr Thr Gly Val Leu Asp He Leu Ala Trp Leu 

155 160 165 

Pro Tyr Gly He He His Phe Ser Phe Pro Phe Val Leu Ala Ala 

170 175 180 

He He Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg Ser Phe Gly 

185 190 195 
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Phe Ala Phe Gly Tyr Met Asn Leu Leu Gly Val Leu He Gin Met 
200 205 210 

Ala Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu 
215 220 225 

Glu Pro Ala Asn Tyr Ser Met His Gly Ser Pro G!y Gly Leu Gly 
230 235 240 

Arg He Asp Lys Leu Leu Gly Val Asp Met Tyr Thr Thr Gly Phe 
245 250 255 

Ser Asn Ser Ser lie lie Phe Gly Ala Phe Pro Ser Leu His Ser 
260 265 . 270 

Gly Cys Cys He Met Glu Val Leu Phe Leu Cys Trp Leu Phe Pro 
275 280 285 

Arg Phe Lys Phe Val Trp Val Thr Tyr Ala Ser Trp Leu Trp Trp 
290 295 300 

Ser Thr Met Tyr Leu Thr His His Tyr Phe Vai -Asp Leu He Gly 
305 310 315 

Gly Ala Met Leu Ser Leu Thr Val Phe Glu Phe Thr Lys Tyr Lys 
320 325 330 

Tyr Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr 
335 340 345 

Thr Glu He Glu Lys He Asp He Gin Glu He Asp Pro Leu Ser 
350 355 360 

Tyr Asn Tyr He Pro Val Asn Ser Asn Asp Asn Glu Ser Arg Leu 
365 370 375 

Tyr Thr Arg Val Tyr Gin Glu Pro Gin Val Ser Pro Pro Gin Arg 
380 385 390 

Ala Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe Ser Arg Ser 
395 400 405 

Arg Gin Ser Ser Lys Thr Gin Val Pro Leu Ser Asn Leu Thr Asn 
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410 

Asn Asp Gin Val Pro Gly He Asn Glu 

5 

425 

Gly Asp Glu He Ser Ser Ser Thr Pro 
iq 440 

Pro Gin Gly Ser Thr Tyr Ala Ala Ser 
455 

75 

Asp Leu Asp Ser Lys Arg Asn 
470 

20 

Claims 

1. An isolated gene coding for a protein which regulates aureobasidin sensitivity. 

25 

2. An isolated gene as claimed in Claim 1 which is contained in a DNA fragment represented by a 
restriction enzyme map as specified in any of Fig. 1 to Fig. 3. 

3. An isolated gene as claimed in Claim 1 which is hybridizable with a gene of Claim 2. 

30 

4. A process for cloning a gene of Claim 1 characterized by using a gene of Claim 2 or 3 or a part thereof 
as a probe. 

5. A nucleic acid probe which comprises a sequence consisting of 15 or more bases and is hybridizable 
35 with a gene of Claim 1 . 

6. An antisense DNA of a gene which codes for a protein regulating aureobasidin sensitivity. 

7. An antisense RNA of a gene which codes for a protein regulating aureobasidin sensitivity. 

40 

8. A recombinant plasmid containing a gene of Claim 1 . 

9. A transformant having a recombinant plasmid of Claim 8 introduced thereinto. 

45 10. A process for producing a protein regulating aureobasidin sensitivity characterized by culturing a 
transformant of Claim 9 and collecting the protein regulating aureobasidin sensitivity from the culture. 

11. An isolated protein regulating aureobasidin sensitivity which is encoded by a gene of Claim 1. 

so 12. An antibody against a protein of Claim 11. 

13. A process for detecting a protein regulating aureobasidin sensitivity which comprises using an antibody 
of Claim 12. 

55 14. A process for detecting a gene coding for a protein regulating aureobasidin sensitivity which comprises 
the hybridization with the use of a nucleic acid probe of Claim 5. 



415 

Glu Asp 
430 

Ser Val 
445 

Ser Ala 
460 



Glu Glu 
Phe Glu 
Thr Ser 



420 
Glu Glu 

435 - 
Asp Glu 

450 
Val Asp 

465 
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15. A process for screening an antimycotic which comprises using a transformant of Claim 9 or a protein of 
Claim 11. 
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Fig, 4 
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Fig. 6 ■ 
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Fig. 8 
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Fig- 10 
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Fig. 11 
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